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HUMAN MATRIX METALLOPROTEASE GENE, PROTEINS ENCODED THEREFROM 

AND METHODS OF USING SAME 

Background of the Invention 

1. Field of the Invention 

The present invention relates generally to therapeutic and diagnostic agents in cancer 
and inflammatory diseases. More specifically, the present invention relates to polynucleotide 
sequences encoding matrix metalloprotease proteins, as well as methods which utilize these 
1 0 sequences, which are useful for the detection, diagnosis, staging, monitoring, prognosis, 
prevention, or treatment of cancer or inflammatory diseases. 

2. Description of the Related Art 

The matrix metalloproteases (MMPs) are a multi-enzyme family capable of completely 

1 5 degrading the components of the extracellular matrix (ECM), their natural substrates (W.B. 

Ennis, and L. M. Matrisian, 7. Neuro-Oncology, 18: 105- 109 (1994)). The ECM is a 
meshwork of cells and various types of collagens and proteoglycans, collectively called 
connective tissue, which provides mechanical support and helps to maintain the structural 
integrity of tissues and organs. The function of the ECM is particularly apparent in articular 

2 0 cartilage where it provides cushioning and ease of movement between bones in joints. The 

MMPs are secreted by the cellular components of the ECM (fibroblasts, chondrocytes and 
synoviocytes) and inflammatory cells (neutrophils and macrophages) in inactive forms 
(zymogens) which are converted extracellularly to the active enzymes by various proteinases. 
Normally MMPs function in a highly regulated fashion as part of the physiological turnover of 

2 5 the ECM, effectively renewing and remodeling the ECM. However, in the clinical features of 

several diseases, the ECM is degraded and there is much evidence to support that MMPs play a 
significant pathological role in ECM degradation. 

At least fourteen members of the MMP family have been identified and most assigned 
EC numbers; several more have been discovered recently. They can be classified generally 

3 0 according to four subgroups based on substrate preference or cellular localization; e.g,, 

collagenases prefer Type I and II collagen, gelatinases prefer Type IV collagen, and 
stromelysins prefer proteoglycans (L. M. Matrisian, Bio. Essays 14: 455-463 (1992)). The 
fourth subgroup are the membrane type MMPs (muMMPs) which are characterized by the 
presence of a hydrophobic transmembrane domain near the C-terminus for anchoring the 
3 5 protein in the cell membrane. All of the other MMPs arc secreted into the extracellular milieu. 

Most of the known MMPs contain zinc in their catalytic sites and require calcium for 
activity. The major human MMPs have been cloned and exhibit greater than 50% homology. 
They contain a leader sequence for signaling their secretion by cells; a highly conserved pro- 
enzyme sequence removed upon activation; a catalytic site with a highly conserved zinc 
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binding dontiain; and a carboxy terminal region containing a conserved sequence similar to 
hemopexin, a heme binding protein. Although MMPs can be readily activated in vitro using 
mercurial compounds or trypsin, the precise mechanism for propeptide removal and activation 
of MMPs in vivo is not understood. Some MMPs can undergo an autoactivation process, 
5 while recent evidence indicates that membrane type MMPs may function as activators of other 
MMPs. 

A grouping line of evidence implicates the MMPs as important enzymes in cancer 
metastasis. Although different cancer cell lines have been shown to express various MMPs 
when grown in culture, gelatinase A in particular has been the focus of a number of recent 
1 0 studies which demonstrates its role in the invasiveness of cancer cells (W. G. Stetler- 

Slcycnson, etaL,FASEB J. 7: 1434-1441 (1993)). For example, gelatinase A is found in the 
urine of bladder cancer patients and specific monoclonal antibodies have been used to detect 
the enzyme in breast tumor sections {I.M. Margulies, et al, 1: 467-474 (1992)). 
The enzyme is expressed in an invasive prostate cancer cell line (PC-3 ML) and cells 

1 5 transfected with the gelatinase A gene arc capable of extravasation when injected into rhice (M. 

E. Stearns, and M. Wang, Oncology Res. 6: 195-201 (1994)). These studies and others 
implicate the involvement of gelatinase A in tumor metastasis, and suggest that inhibitors of 
this enzyme may offer therapeutic potential in certain forms of cancer. A broad spectrum 
matrix mctalloproteinase inhibitor has been shown to decrease the tumor burden of mice 

2 0 bearing ovarian carcinoma xcnographs (B. Davies, et ai. Cancer Res, 53: 2087-2091 

(1993)). This compound (BB-94, batimastat) is currently being evaluated in clinical trials for 
malignant ascites (S.A. Watson, etal. Cancer Research, 55: 3629-3633 (1995)); however, its 
poor bioavailability necessitates parental administration. Gelatinase A selective succinyl 
hydroxamates have been suggested as anti-cancer agents as well, yet these compounds possess 

2 5 the same peptidic backbone as batimastat (Porter, J. R.; Beeley, N. R. A.; Boyce, B. A.; 

Mason, B.; Millican, A.; Millar, K.; Leonard, J.; Morphy, J. R.; O'Gonnell, J. P.. Potent and 
selective inhibitors of gelatinase A 1. Hydroxamic acid derivatives. Bioorg. Med. Chem. Lett., 
4: 2741-2746 (1994)). More recently, an orally active, broad spectrum, MMP inhibitor (BB- 
2516, marimastat) was reported to stop progression of colorectal, ovarian, prostatic and 

3 0 pancreatic cancer (R. ?. Beckett, et ai, D.D.T.A'- 16-26 (1996)). 

Several lines of evidence indicate that the unregulated activity of MMPs is responsible 
for the joint degradation observed in rheumatoid arthritis and osteoarthritis. In these human 
arthritides, activated forms of the MMPs and their products (glycosaminoglycans and collagen 
fragments; are found in synovial fluids and joint tissues in abnormally high amounts (see E.D. 
3 5 Harris Jr., Role of collagenase in joint destruction. The Joints and Synovial Fluid, Vol., 1 , 
Sokoiogg, L., Ed., Orlando, FL, Academic Press, 1977, T.E. Cawston, et ai. Arthritis 
Rheum., 27: 285-290 (1984), Cawston, T. Ann. Rheumatic Diseases., 52: 769-770 (1993), 
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Y.L. Okada. et ai, J. Biol Chem., 261: 14245-14255 (1986), D.L. Scott, et al.Molec. 
Aspects Med., 12: 341-394 (1991), and E.D. Harris, et ai. Arthritis Rheum., 12: 92-102 
(1992)). These arthritic tissues also show a greater-than-normal expression of MMPs (S.S, 
UzOdchx^n, Arthritis and Rheumatism , 34: 1085-1093 (1991), AJ.P. Docherty and G. 
5 Murphy, Ann. Rheumat. Dis., 49: 469-479 (1990)) which is induced by cytokines and 

growth factors, also found abundantly in these tissues (D.L. Scott, et al, (1991), supra, S.M. 
Frisch and H.E. Rulcy, J. Biol. Chem,, 262: 16300-16304 (1987)). In addition, the 
activities of MMPs in normal tissues are thought to be regulated by the presence of 
endogenous tissue inhibitors of MMPs, (TIMPs, see T.E. Cawston, Curr. Med. Lit. Rheum., 
1 0 3: 1 27-0 ( 1 984)). The ratio of the amounts of TIMP and MMPs is thought to maintain a 

balance between the rates of degradation and synthesis of ECM. In tissues from rheumatoid, 
arthritics, an abnormally high expression of MMPs results in an imbalance of these enzymes 
and degradation of ECM (S.S. McCachren ( 1 99 1 ) and T.E. Cawston (1984), supra). Thus in 
arthritis, inhibition of the exacerbated degradative activities of MMPs by specific agents could 

1 5 help restore this balance. In rodent models which mimic the biochemical features of arthritis, 

there is evidence that the combined action of proteoglycan loss (due to stromelysin activity) 
and cartilage degradation (due to collagenase) are early events in this disease (R.M. Hembry, 
et aL,Am, J. Pathol, 143: 628-642 (1993), K.A. Hasty, et al. Arthritis and Rheumatism , 
33: 388-397 (1990)). Several prototype inhibitors of MMPs have been shown to reduce 

2 0 cartilage degradation in these animal models (MJ. DiMartino, et ai, J. Cell Biochem. suppl., 

19E: 179 (1991), P. Brown, et ai. Orally active inhibitors of cartilage degradation. Abst. # 
81, Abstracts of Inflammation '93, Vienna, Austria, p. 29. (1993)). 

There is much evidence to suggest that MMPs mediate the migration of inflammatory 
cells into endothelium (D., Moscaielii and D.B. Rifkin, Biochim. Biophys, Acta., 948: 67-85 

2 5 (1988), P. Zaoui, et al. Matrix metal loproteases (MMP) exocylosis from neutrophils is 

inhibited by endothelial adhesion. Abst. # 83, Abstracts of Inflammation '93i, Vienna, Austria, 
p. 29 (1993)) participating in periodontal diseases (H. Birkedal-Hansen, J. PeridontoL, 64: 
474-484 (1993)) and facilitating the growth of atherosclerotic plaques (A. M. Henny, etaL, 
Proc. Natl. Acad. ScL, 8: 8154-8158 (1991)). Recently, gelalinasc-A was reported to 

3 0 promote cleavage of the amyloid protein precursor which would suggest a role in Alzheimers 

disease for this MMP (N. Peress, et al, J. Neuropathol. Exp. Neurol, 54: 16-22 (1995), 
R.N. Lipagc, et al, FEBS Utt, 377: 267-270 (1995)). Thus, there is compelling evidence 
that MMPs play an important role in arthritis and other inflammatory diseases and t!-;it targete(l 
inhibition of these proteinases by pharmaceutical agents could have beneficial effects. 
3 5' It would be advantageous to provide specific methods and reagents for the diagnosis, 

.staging, prognosis, monitoring, prevention or treatment of diseases and conditions associated 
with imbalances in the production or activity of MMPs or to indicate possible predisposition to 
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these conditions. Such methods would include assaying a test sample for products of the 
gene. Such methods would comprise making cDNA from mRNA in the test sample, 
amplifying (when necessary) portions of the cDNA corresponding to the gene or a fragment 
thereof, and delecting the cDNA product as an indication of the presence of the cancer; or 
5 detecting translation products of the mRN As comprising gene sequences as an indication of the 
presence of the disease. These reagents include polynucleotide(s), or fragment(s) thereof 
which may be used in diagnostic methods such as reverse transcriptase-polymerase chain 
reaction (RT-PCR), PCR, or hybridization assays of biopsied tissue; or proteins which are the 
translation products of such mRNAs; or antibodies directed against these proteins. Such 
1 0 assays would include methods for assaying a sample for product(s) of the gene and detecting 
the product(s) as an indication of disease. Drug treatment or gene therapy for conditions or 
diseases associated with these detected diseases and conditions then can be based on these . 
identified gene sequences or their expressed proteins, and efficacy of any particular therapy 
can be monitored using the diagnostic methods disclosed herein. . 

1 5 Studies to understand the role of MMPs in tumor growth, metastases, or inflammatory 

conditions and diseases such as arthritis and their potential as a therapeutic or diagnostic tools 
are limited to previously described MMP proteins. It would be advantageous to identify novel 
human MMPs which may be directly involved in the physiology of, for example, tumor 
growth, extravasation, or invasion. Isolation of DNA sequences encoding human MMPs 

2 0 would permit more extensive studies on the association and regulation of individual MMPs in , 

specific cancers or tumor types, and in diseases such as arthritis or other inflammatory 
conditions. In addition, the identification of tissue-specific or disease-specific MMPs would 
provide more direct targets for therapeutics designed to attenuate these diseases. 

25 Summarv of the Invention 

This present invention provides an isolated and purified polynucleotide encoding a 
matrix metal loprotease (provisionally named MMP 1 9), polynucleotide fragments thereof, 
expression vectors containing those polynucleotides, host cells transformed with those 

3 0 expression vectors, processes for making the MMP protein using those polynucleotides and 

vectors, isolated and purified MMP protein and polypeptide fragments thereof, and antibodies 
raised to synthetic peptides derived from the MMP protein. The invention also provides 
diagnostic assays to identify the pre.sence of the MMP polynucleotide or polypeptide, assays 
used to identify agents that affect the function of the MMP polynucleotide or polypeptide, Ln^d 
3 5 the use as therapeutic agents of the MMP polynucleotide, polypeptides, or antibodies. 

The cDNA clone was obtained by screening a human cDNA expression database with a 
consensus sequence to twelve other human MMPs. The sequence of the partial cDNA isolated 
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5 

indicated that the gene product is a novel MMP protein that is expressed in a linnited number of 
tissues. 

Brief Description of the Drawings 

FIG. 1 shows the MMP consensus sequence used to query the Li feScq**'"^ database. 
The functional motifs present in all MMPs are double underlined. 

FIG. 2 shows the complete nucleotide and translated sequence (frame 3) of EST 
#907334 in LifeSeq^"^. A putative cysteine switch motif (PRCGVTD which is SEQ ID NO:8) 
and a putative furin recognition site (RKKR which is SEQ ID NO:9) are double underlined. 

FIG. 3 shows the length and position of individual sequencing reactions to generate the 
double stranded nucleotide sequence of the insert in clone #907334. The boxes at the bottom 
are graphic representations of the three forward reading frames with start codons shown as 
short upward lines and stop codons as longer downward lines. Frame 1 contains the longest 
open reading frame. 

FIG. 4 shows the doubled stranded complete nucleotide sequence (SEQ ID NO: I and 
SEQ ID N0:2, top and bottom strand respectively) of the MMP 19 gene and the translated 
amino acid sequence (SEQ ID N0:9) of MMP 19 (shown beneath in single letter codes). 
Indicated in bold type are the structural domains found in other MMPs. The cysteine switch, 
furin recognition site, and zinc binding site are double underlined. The putative poly 
adenylalion signal in the 3' untranslated region of the mRNA is single underlined. Boxes 
indicate the position of the 3* end of the primers used in the synthesis of DNA conslmcts for 
the expression of the catalytic portion of MMP 19. 

FIG. 5 shows the alignment of the polypeptide encoded by SEQ ID NO; 10 with the 
amino acid sequences of all known human MMPs, namely, matriiysin (GenBank Accession 
No. L22524, SEQ ID NO: 1 8), Gelatinase A (GenBank Accession No. M55593, SEQ ID 
NO: 19), Gelatinase B (GenBank Accession No. J05070, SEQ ID NO:20), Stromelysin 1 
(GenBank Accession No. J03209, SEQ ID N0:2I). Stromelysin 2 (GenBank Accession No. 
X07820, SEQ ID N0:22), Stromelysin 3, (GenBank Accession No. X57766. SEQ ID 
N0:23). Collagenase 1 (GenBank Accession No. M 1 3509, SEQ ID NO:24), Collagenase 2 
(GenBank Accession No. J05556. SEQ ID NO:25), Collagenase 3 (GenBank Accession No. 
X75308, SEQ ID NO:26), MMP 12 (GenBank Accession No. L23808, SEQ ID NO:27), 
MMP18 (GenBank Accession No. Y08622 or X92521, SEQ ID NO:28), mtlMMP (GenBank 
Accession No. D26512, SEQ ID NO:29), mt2MMP (GenBank Accession No. Z48482, SEQ 
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ID NO:30), mt3MMP (GenBank Accession No. D50477, SEQ ID N0:31), and mt4MMP 
(GenBank Accession No. X89576, SEQ ID NO:32). The consensus sequence was generated 
using a plurality of 5. Characteristic motifs found in MMPs are double underlined. 

5 FIG. 6 shows the alignment and tissue distibution of ESTs found in LifeSeq'^'^' which 

are 1 00% identical to regions of MM? 1 9 cDN A. 

FIG. 7 shows a Phospholmage of a Northern blot of human RNA (2|ig of poly A+ 
RNA/Iane) probed with a ^^-labeled MMPl 9 cDNA probe. 

10 

Detailed Description of the Invention 

The present invention provides isolated and purified polynucleotides that encode a 
human matrix metalloprotease, fragments thereof, expression vectors containing those 

1 5 polynucleotides, host cells transformed with those expression vectors, a process of making the 

MMP using those polynucleotides and vectors, and isolated and purified recombinant MMP 
and polypeptide fragments thereof. 

The present invention also provides for the use of MMPl 9 in developing treatments for 
any disorder mediated (directly or indirectly) by insufficient amounts or production of MMPl 9 

2 0 protein. Purified human MMP19 protein may be administered to a patient with .such a 

condition. Alternatively, gene therapy techniques for producing MMPl 9 polypeptide in vivo 
are also provided. 

The present invention also provides methods for assaying a test sample for products of 
the MMP gene, which comprises making cDN A from mRNA in the test sample, and detecting 

2 5 the cDNA as an indication of the presence of the MMP gene. The method may include an 

amplification step, wherein portions of the cDNA corresponding to the gene or fragment . 
thereof is amplified. Methods also are provided for assaying for the translation products of 
mRNAs. Test samples which may be assayed by the methods provided herein include tissues, 
cells, body fluids and secretions. The present invention also provides reagents such as 

3 0 oligonucleotide primers and polypeptides which are useful in performing these methods. 

Portions of the nucleic acid sequences disclosed herein arc useful as primers for the 
reverse transcription of RNA or for the amplification of cDNA; or as probes to determine the 
presence of certain cDNA sequences in test samples. Also disclosed are nucleic acid 
.sequences which permit the production of encoded polypeptide sequences which are useful as 
3 5 standards or reagents in diagnostic immunoassays, targets for pharmaceuiical screening assays 
and/or as components or target sites for various therapies. Monoclonal and polyclonal 
antibodies directed against at least one epitope contained within these polypeptide sequences 
are useful for diagnostic tests as well as delivery agents for therapeutic agents and for 
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screening for diseases or conditions associated with cancers, arthritis, or inflannmation. 
Isolation of sequences of other portions of the gene of interest can be accomplished by utilizing 
probes or PCR primers derived from these nucleic acid sequences, thus allowing additional 
probes and polypeptides of the genome of interest to be established, which also will be useful 
5 in the diagnosis, prognosis and/or treatment of diseases and conditions characterized by the 
MMP gene disclosed herein. 

The techniques for determining the amino acid sequence ^'similarity" are well-known in 
the art. In general, "similarity" means the exact amino acid to amino acid comparison of two 
or more polypeptides at the appropriate place, where amino acids are identical or possess 
1 0 similar chemical and/or physical properties such as charge or hydrophobicity. A so-termed 

"percent similarity" then can be determined between the compared polypeptide sequences. The 
techniques for determining nucleic acid and amino acid sequence identity also are well known 
in the art and include determining the nucleotide sequence of the mRNA for that gene (usually 
via a cDNA intermediate) and determining the amino acid sequence encoded therein, and 

1 5 comparing this to a second amino acid sequence. In general, "identity" refers to an exact 

nucleotide to nucleotide or amino acid to amino acid correspondence of two polynucleotides or 
polypeptide sequences, respectively. Two or more polynucleotide sequences can be compared 
by determining their "percent identity." Two amino acid sequences likewise can be compared 
by determining their "percent identity." The programs available in the Wisconsin Sequence 

2 0 Analysis Package (available from Genetics Computer Group, Madison, WI), for example, the 

GAP program, are capable of calculating both the identity between two polynucleotides and the 
identity and similarity between two polypeptide sequences, respectively. Other programs for 
calculating identity or similarity between sequences are known in the art. 

The compositions and methods described herein will enable the identification of certain 

2 5 markers as indicative of cancer, arthritis, or inflammation resulting from or associated with an 

MMP19 disorder; the information obtained therefrom will aid in the diagnosis, staging, 
monitoring, prognosis and/or therapy of those same di.sea.ses or conditions. Test methods , 
include, for example, probe assays which utilize the sequence(s) provided herein and which 
also may utilize nucleic acid amplification methods such as the polymerase chain reaction 

3 0 (PGR), the Iiga.se chain reaction (LCR); and hybridization. In addition, the nucleotide 

sequences provided herein contain open reading frames from which one or more immunogenic 
epitopes may be found. Such epitopes are believed to be unique to the disease state or 
condition associated with MMP 19 related cancer, arthritis, or inflammation. The uniqueness 
of any epitope may be determined by its immunological reactivity with the .specific MMP gene 
3 5 in diseased tissues and lack of immunological reactivity with non-diseased tissues. Methods 

for determining immunological reactivity are well-known and include but are not limited to, for 
example, radioimmunoassdy (RIA), enzyme-linked immunosorbent assay (ELISA), 
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hemagglutination (HA), fluorescence polarization immunoassay (FPIA); chemiiuminescent 
immunoassay (CLIA), and others; several examples of suitable methods are described herein. 
Unless otherwise stated, the following terms shall have the following meanings: 
A polynucleotide "derived from" a designated sequence refers to a polynucleotide 
5 sequence which is comprised of a sequence of approximately at least about 6 nucleotides, is 

preferably at least about 8 nucleotides, is more preferably at least about 10-12 nucleotides, and 
even more preferably is at least about 15-20 nucleotides corresponding, i.e., identical to or 
complementary to, a region of the designated nucleotide sequence. The sequence may be 
complementary to or identical to a sequence which is unique to a particular polynucleotide 
1 0 sequence as determined by techniques known in the art. Comparisions to sequences in 

databanks, for example, can be used as a method to determine the uniqueness of a designated 
sequence. Regions from which sequences may be derived include but are not limited to 
regions encoding specific epitopes, as well as non-translated and/or non-transcribed regions. 
The derived polynucleotide will not necessarily be derived physically from the 

1 5 nucleotide sequence of interest under study, but may be generated in any manner, including 

but not limited to chemical synthesis, replication, reverse transcription or transcription, which 
is based on the information provided by the sequence of bases in the region(s) from which the 
polynucleotide is derived; as such, it may represent either a sense or an.antisense orientation of 
the original polynucleotide. In addition, combinations of regions corresponding to that of the 

2 0 designated sequence may be modified in ways known in the art to be consistent with an 

intended use. 

The term "primer" denotes a specific oligonucleotide sequence complementary to a 
target nucleotide sequence and used to hybridize to the target nucleotide sequence and .serve as 
an initiation point for nucleotide polymerization catalyzed by either DNA polymerase, RN A 

2 5 polymerase or reverse transcriptase. 

The term "probe" denotes a defined nucleic acid segment (or nucleotide analog 
segment, i.e., PNA) which can be used to identify specific DNA or RNA present in samples 
bearing the complementary sequence. 

A "polypeptide" or "annino acid" sequence derived from a designated nucleic acid 

3 0 sequence refers to a polypeptide having an amino acid sequence identical to that of a 

polypeptide encoded in the sequence or a portion thereof wherein the portion consists of at 
least 3 to 5 contiguous amino acids, and more preferably. at, least 8 to 10 contiguous amino 
acids, and even more preferably 15 to 20 continguous amino acids, or which is 
immunologically identifiable with a polypeptide encoded in the sequence. 
3 5 A "recombinant polypeptide" as used herein means at least a polypeptide which by 

vinue of its origin or manipulation is not associated with all or a portion of the polypeptide 
with which it is associated in nature and/or is linked to a polypeptide other than that to which it 
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is linked in nature. A recombinant or derived polypeptide is not necessarily translated from a 
designated nucleic acid sequence. It also may be generated in any manner, including chemical 
synthesis or expression of a recombinant expression system. 

The term "synthetic peptide" as used herein means a polymeric form of amino acids of 
any length, which may be chemically synthesized by methods well-known to the routineer. 
These synthetic peptides are useful in various applications. 

The term "polynucleotide" as used herein means a polymeric form of nucleotides of 
any length, either ribonucleotides or deoxyribonucleotides/ This term refers only to the 
primary structure of the molecule. Thus, the term includes double- and single-stranded DNA, 
as well as, double- and single-stranded RNA. It also includes modifications, such as 
methylation or capping, and unmodified forms of the polynucleotide. 

"A sequence corresponding lo a cDN A" means that the sequence contains a 
polynucleotide sequence that is identical to or complementary to a sequence in the designated 
DNA. The degree (or "percent") of identity or complementarity to the cDNA will be 
approximately 50% or greater, will preferably be at least about 70% or greater, and more 
preferably will be at least about 90%. The sequence that corresponds will be at least about 50 
nucleotides in length, will preferably be about 60 nucleotides in length, and more preferably, 
will be at least about 70 nucleotides in length. The correspondence between the gene or gene 
fragment of interest and the cDNA can be determined by methods known in the art, and 
include, for example, a direct comparison of the sequenced material with the cDNAs 
described, or hybridization and digestion with single strand nucleases, followed by size 
determination of the digested fragments. 

"Purified polynucleotide" refers to a polynucleotide of interest or fragment thereof 
which is essentially free, i.e., contains less than about 50%, preferably less than about 70%, 
and more preferably, less than about 90% of the protein with which the polynucleotide is 
naturally associated. Techniques for purifying polynucleotides of interest are well-known in 
the art and include, for example, disruption of the cell containing the polynucleotide with a 
chaotropic agent and .separation of the polynucleotide(s) and proteins by ion-exchange 
chromatography, affinity chromatography and sedimentation according to density. Thus, 
"purified polypeptide" means a polypeptide of interest or fragment thereof which is essentially 
free, that is, contains less than about 50%, preferably less than about 70%, and more 
preferably, less than about 90% of cellular components with which the polypeptide of interest 
is naturally associated. Methods for purifying are known in the art. 

The term "isolated" means that the material is removed from its original environment 
(e.g., the natural environment if it is naturally occurring). For example, a naturally-occurring 
polynucleotide or polypeptide present in a living animal is not isolated, but the .same 
polynucleotide or DNA or polypeptide, which is separated from some or all of the coexisting 
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materials in the natural system, is isolated. Such polynucleotide could be part of a vector 
and/or such polynucleotide or polypeptide could be part of a composition, and still be isolated 
in thai the vector or composition is not part of its natural environment. 

"Polypeptide" as used herein indicates a molecular chain of amino acids and does not 
5 refer to a specific length of the product. Thus, peptides, oligopeptides and proteins are 

included within the definition of polypeptide. This term, however, is not intended to refer to 
post-expression modifications of the polypeptide, for example, glycosylations, acetylations, - 
phosphorylations and the like. 

"Recombinant host cells," "host cells," "cells," "cell lines," "cell cultures," and other 
1 0 such terms denoting microorganisms or higher eukaryotic cell lines cultured as unicellular 

entities refer to cells which can be, or have been, used as recipients for recombinant vector or 
other transferred DNA, and include the original progeny of the original cell which has been 
transfected. 

As used herein "replicon" means any genetic element, such as a plasmid, a 

1 5 chromosome or a virus, that behaves as an autonomous unit of polynucleotide replication 

within a cell. 

A "vector" is a replicon in which another polynucleotide segment is attached, such as to 
bring about the replication and/or expression of the attached segment. 

The term "control sequence*' refers to polynucleotide sequences which are necessary to 

2 0 ^effect the expression of coding sequences to which they arc ligated. The nature of such control 

sequences differs depending upon the host organism. In prokaryotes, such control sequences 
generally include promoter, ribosomal binding site and terminators; in eukaryotes, such control 
sequences generally include promoters, terminators and, in some instances, enhancers. The 
term "control sequence" thus is intended to include at a minimum all components whose 

2 5 presence is necessary for expression, and also may include additional components whose 

presence is advantageous, for example, leader sequences. 

"Operably linked" refers to a situation wherein the components described arc in a 
relationship permitting them to function in their intended manner. Thus, for example, a control 
sequence "operably linked" to a coding sequence is ligated in such a manner that expression of 

3 0- the coding sequence is achieved under conditions compatible with the control sequences. 

The term "open reading frame" or "ORF" refers to a region of a polynucleotide 
sequence which encodes a polypeptide; this region may represent a portion of a coding 
sequence or a total coding sequence. 

A "coding sequence" is a polynucleotide sequence which is transcribed into niRNA and 
3 5 . translated into a polypeptide when placed under the control of appropriate regulatory 

sequences. The boundaries of the coding sequence are determined by a translation start codon 
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at the 5' -terminus and a translation stop codon at the 3' -terminus. A coding sequence can 
include, but is not limited to, mRNA, cDNA, and recombinant polynucleotide sequences. 

The term "immunologically identifiable with/as" refers to the presence of epitope(s) and 
polypeptide(s) which also are present in and are unique to the designated polypeptide(s). 
Immunological identity may be determined by antibody binding and/or competition in binding. 
These techniques are known to the routineer and also are described herein. The uniqueness of 
an epitope also can be determined by computer searches of known data banks, such as 
GenBank, for the polynucleotide sequences which encode the epitope, and by amino acid 
sequence comparisons with other known proteins. 

As used herein, "epitope" means an antigenic determinant of a polypeptide. 
Conceivably, an epitope can comprise three amino acids in a spatial conformation which is 
unique to the epitope. Generally, an epitope consists of at least five such amino acids, and 
more usually, it consists of at least eight to ten amino acids. Methods of examining spatial 
conformation are known in the art and include, for example, x-ray crystallography and two- 
dimensional nuclear magnetic resonance. 

A ^^conformational epitope" is an epitope that is comprised of specific juxtaposition of 
amino acids in an immunologically recognizable structure, such amino acids being present on 
the same polypeptide in a contiguous or non-contiguous order or present on different 
polypeptides. 

A polypeptide is "immunologically reactive" with an antibody when it binds to an 
antibody due to antibody recognition of a specific epitope contained within the polypeptide. 
Immunological reactivity may be determined by antibody binding, more particularly by the 
kinetics of antibody binding, and/or by competition in binding using as compeiitor{s) a known 
polypeptide(s) containing an epitope against which the antibody is directed. The methods for 
determining whether a polypeptide is immunologically reactive with an antibody are known in 
the art. 

As used herein, the term "immunogenic polypeptide containing an epitope of interest" 
means naturally occurring polypeptides of interest or fragments thereof, as well as 
polypeptides prepared by other means, for example, by chemical synthesis or the expression 
of the polypeptide in a recombinant organism. 

The term "transformation" refers to the in.scrtion of an exogenous polynucleotide into a 
host cell, irrespective of the method used for the insertion. For example, direct uptake, 
transduction or f-mating are included. The exogenous polynucleotide may be maintained as 
non-integrated vector, for example, a plasmid, or alternatively, may be integrated into the hu.M 
genome. 

"Treatment" refers to prophylaxis and/or therapy. 
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The term "individual" as used herein refers to vertebrates, particularly members of the 
mammalian species and includes but is not limited to domestic animals, sports animals, 
primates and humans; more particularly the term refers to humans. 

The term "sense strand" or "plus strand" (or "+") as used herein denotes a nucleic acid 
5 that contains the sequence that encodes the polypeptide.. The terra "antisense strand" or "minus 
strand" (or "-") denotes a nucleic acid that contains a sequence that is complementary to that of 
the "plus" strand. 

The term "test sample" refers to a component of an individual's body which is the 
source of the analytc (such as, antibodies of interest or antigens of interest). These 
1 0 components are well known in the art. These test samples inclucie biological samples which 
can be tested by the methods of the present invention described herein and include human and 
animal body fluids such as whole blood, serum, plasma, cerebrospinal fluid, urine, lymph 
fluids, and various external secretions of the respiratory, intestinal and genitorurinary tracts, 
tears, saliva, milk, white blood cells, myelomas and the like: biological fluids such as cell 

1 5 culture supernatants; fixed tissue specimens; and fixed cell specimens. 

"Purified product" refers to a preparation of the product which has been isolated from 
the cellular constituents with which the product is normally associated, and from other types of 
cells Nvhich may be present in the sample of interest. 

"PNA" denotes a "peptide nucleic acid analog" which may be utilized in a procedure 

2 0 such as an assay described herein to determine the presence of a target. "MA" denotes a 

"morphoiino analog" which may be utilized in a procedure such as an assay described herein to 
determine the presence of a target. See, for example, U.S. Patent No. 5,378,84 1 , which is 
incorporated herein by reference. PNAs are neutrally charged moieties which can be directed 
against RNA targets or DNA. PNA probes used in as.says in place of, for example, the DNA 

2 5 probes of the present invention, offer advantages not achievable when DNA probes are used. 

The.se advantages include manufacturability, large scale labeling, reproducibility, .stability, 
insensitivity to changes in ionic strength and resistance to enzymatic degradation which is 
present in methods utilizing DNA or RNA. These PNAs can be labeled with such signal 
generating compounds as fluorescein, radionucleotides, chemiiuminescent compounds, and 

3 0 the like. PNAs or other nucleic acid analogs such as MAs thus can be used in assay methods 

in place of DNA or RNA. Although assays are described herein utilizing DNA probes, it is 
within the scope of the routineer that PNAs or MAs can be substituted for RNA or DNA with 
appropriate changes if and as needed in assay reagents. 

" Analyte," as used herein, is the substance to be detected which may be present in the 
3 5 test sample. The analytc can be any substance for which there exists a naturally occurring 

specific binding member (such as, an antibody), or for which a specific binding member can 
be prepared. Thus, an analytc is a substance that can bind to one or more .specific binding 
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members in an assay. "Analyte" also includes any antigenic substances, haptens, antibodies, 
and combinations thereof. As a'mcmber of a specific binding pair, the analyte can be detected 
by means of naturally occurring specific binding partners (pairs) such as the use of intrinsic 
factor protein as a member of a specific binding pair for the determination of Vitamin B 1 2, the 
5 use of folaie-binding protein to determine folic acid, or the use of a lectin as a member of a 
specific binding pair for the determination of a carbohydrate. The analyte can include a 
protein, a peptide, an amino acid, a nucleotide target, and the like. 

An "Expressed Sequence Tag" or "EST" refers to the partial sequence of a cDNA insert 
which has been made by reverse transcription of mRNA extracted from a tissue, followed by 
1 0 insertion into a vector. 

A "transcript image" refers to a table or list giving the quantitative distribution of ESTs 
in a library, and represents the genes active in the tissue from which the library was made. 

The present invention provides assays which utilize specific binding members. A 
"specific binding member," as used herein, is a member of a specific binding pair. That is, 

1 5 two different molecules where one of the molecules through chemical or physical means 

specifically binds to the .second molecule. Therefore, in addition to antigen and antibody 
specific binding pairs of common immunoassays, other specific binding pairs can include 
biotin and avidin, carbohydrates and lectins, complementary nucleotide sequences, effector 
and receptor molecules, cofactors and enzymes, enzyme inhibitors and enzymes, and the like. 

2 0 Furthermore, specific binding pairs can include members that are analogs of the original 

specific binding members, for example, an analyte-analog. Immunoreactive specific binding 
members include antigens, antigen fragments, antibodies and antibody fragments, both 
monoclonal and polyclonal, and complexes thereof, including those formed by recombinant 
DNA molecules. 

2 5 The term "hapten," as used herein^ refers to a partial antigen or non-protein binding . 

member which is capable of binding to an antibody, but which is not capable of eliciting 
antibody formation unless coupled to a carrier protein. 

A "capture reagent," as used herein, refers to an unlabeled specific binding member 
which is specific either for the analyte as in a sandwich assay, for the indicator reagent or 

3 0 analyte as in a competitive assay, or for an ancillary specific binding member, which itself is 

specific for the analyte, as in an indirect assay. The capture reagent can be directly or 
indirectly bound to a solid phase material before the performance of the assay or during the 
performance of the assay, thereby enabling the separation of immobilized complexes from the 
test sample. 

3 5 The "indicator reagent" comprises a Vsignal-generating compound" ("label") which is 

capable of generating and generates a measurable signal detectable by external means, 
conjugated ("attached") to a specific binding member. "Specific binding member" as u.sed 
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herein means a mennber of a specific binding pair. That is, two different molecules where one 
of the molecules through chemical or physical means specifically binds to the second molecule. 
In addition to being an antibody member of a specific binding pair, the indicator reagent also 
can be a member of any specific binding pair, including either hapten-anii -hapten systems .such 
5 as biotin or anti-biotin, avidin or biotin, a carbohydrate or a lectin, a complementary nucleotide 
sequence, an effector or a receptor molecule, an enzyme cofactor and an enzyme, an enzyme 
inhibitor or an enzyme, and the like. An immunoreactive specific binding member can be an 
antibody, an antigen, or an antibody/antigen complex that is capable of binding either to 
polypeptide of interest as in a sandwich assay, to the capture reagent as in a competitive assay, 
10 or to the ancillary specific binding member as in an indirect assay. When describing probes 

and probe assays, the term "reporter molecule" may be used. A reporter molecule comprises a 
signal generating compound as described hereinabove conjugated to a specific binding member 
of a specific binding pair, such as carbazol or adamantane. 

The various "signal-generating compounds" (labels) contemplated include 

1 5 chromogens, catalysts such as enzymes, luminescent compounds such as fluorescein and 

rhodamine, chemiluminescent compounds such as dioxetanes, acridiniums, phenanthridiniums 
and luminol, radioactive elements, and direct visual labels. Examples of enzymes include 
alkaline phosphatase, horseradish peroxidase, beta-galactosidase, and the like. The selection of 
a particular label is not critical, but it will be capable of producing a signal either by itself or in 

2 0 conjunction with one or more additional substances. 

"Solid phases" ("solid supports") arc known to those in the art and include the walls of 
wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, nitrocellulose strips, 
membranes, microparticles such as latex particles, sheep (or other animal) red blood cells, and 
Duracytes® (red blood cells "fixed" by pyruvic aldehyde and formaldehyde, available from 

2 5 Abbott Laboratories, Abbott Park, IL) and others. The "solid phase" is not critical and can be 

selected by one skilled in the art. Thus, latex particles, microparticles, magnetic or non- 
magnetic beads, membranes, plastic tubes, walls of microtiter wells, glass or silicon chips, 
sheep (or other suitable animal's) red blood cells and Duracytes® are all suitable examples. 
Suitable methods for immobilizing peptides on solid phases include ionic, hydrophobic, 

3 0 covalent interactions and the like. A "solid phase", as used herein, refers to any material 

which is insoluble, or can be made insoluble by a subsequent reaction. The solid phase can be 
chosen for its intrinsic ability to attract and immobilize the capture reagent. Alternatively, the 
solid phase can retain an additional receptor which has the ability to attract and immobilize the 
capture reagent. The additional receptor can include a charged substance that is oppositely 
3 5 charged with respect to the capture reagent itself or to a charged substance conjugated to the 
capture reagent. As yet another alternative, the receptor molecule can be any specific binding 
member which is immobilized upon .(attached to) the solid phase and which has the ability to 
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immobilize the capture reagent through a sp)ecific binding reaction. The receptor molecule 
enables the indirect binding of the capture reagent to a solid phase material before the 
performance of the assay or during the performance of the assay. The solid phase thus can be 
a plastic, deriyatized plastic, magnetic or non-magnetic metal, glass or silicon surface of a test 
5 tube, microliter well, sheet, bead, microparticle, chip, sheep (or other suitable animal's) red 
blood cells, Duracytes® and other configurations known to those of ordinary skill in the art. 

It is contemplated and within the scope of the present invention that the solid phase also 
can comprise any suitable porous material with sufficient porosity to allow access by detection 
antibodies and a suitable surface affinity to bind antigens. Microporous structure generally arc 

1 0 preferred, but materials with gel structure in the hydrated state may be used as well. Such 

useful solid suppons include but are not limited to nitrocellulose and nylon. It is contemplated 
that such porous solid suppons de.scribed herein preferably are in the form of sheets of 
thickness from about 0.01 to 0.5 mm, preferably about 0. 1 mm. The pore size may vary 
within wide limits, and preferably is from about 0.025 to 15 microns, especially from about 
15 0.15 to 15 microns. The surface of such supports may be activated by chemical processes . 
which cause covalent linkage of the antigen or antibody to the support. The irreversible 
binding of the antigen or antibody is obtained, however, in general, by adsorption on the 
porous material by poorly understood hydrophobic forces. Other suitable solid supports are 
known in the art. 

2 0 Re^gpnt? . 

The present invention provides reagents such as human MMP 19 polynucleotide 
sequences derived from an MMP19 of interest, polypeptides encoded therein, and antibodies 
developed from these polypeptides. The present invention also provides reagents such as 
oligonucleotide fragments derived from the disclosed polynucleotides and nucleic acid 

2 5 sequences complementary to the these polynucleotides. The polynucleotides or polypeptides 

or antibodies of the present invention may be used in the diagnosis, prognosis, and/or 
treatment of individuals with conditions associated with the expression of the MMPl 9 gene, 
such as cancer, arthritis, or inflammation, or to identify a predisposition to these conditions. 
The sequences disclosed herein represent unique polynucleotides which can be used in assays 

3 0 or for producing a disease specific profile of gene iran.scription activity. 

Selected MMP 19 polynucleotides can be used in the methods described herein for the 
detection of normal or altered gene expression. Such methods may employ the MMP 1 9 
polynucleotides disclo.sed herein or oligonucleotides, fragments or derivatives thereof, or 
nucleic acid sequences complementary to these polynucleotides. 
3 5 The polynucleotides di.sclosed herein, their complementary sequences or fragments of 

either can be used in assays to detect, amplify or quantify genes, cDNAs or mRNAs relating to 
cancer, arthritis, or inflammation and associated conditions. They also can be used to identify 
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ixn entire or partial coding region which encodes for MMP19 polypeptide. They further can be 
provided in individual containers in the form of a kit for assays, or provided as individual 
compositions. If provided in a kit for assays, other suitable reagents such as buffers, 
conjugates and the like may be included. 
5 The polynucleotide(s) may be in the form of mRNA or DNA. Polynucleotides in the 

form of DNA, cDNA, genomic DNA, and synthetic DNA are within the scope of the pre.sent 
invention. The DNA may be double-stranded or single-stranded, and if single stranded may 
be the coding (sense) strand or non-coding (anti-sen.^e) strand. The coding sequence which 
encodes the polypeptide may be identical to the coding sequence provided herein or may be a 
1 0 different coding sequence which coding sequence, as a result of the redundancy or degeneracy 
of the genetic code, encodes the same polypeptide as the DNA provided herein. 

This polynucleotide may include only the coding sequence for the polypeptide, or the 
coding sequence for the polypeptide and additional coding sequence such as a leader or 
secretory sequence or a proprotein sequence, or the coding sequence for the polypeptide (and 

1 5 optionally additional coding sequence) and non-coding sequence, such as a non-coding 

sequence 5' and/or 3' of the coding .sequence for the polypeptide. 

In addition, the invention includes variant polynucleotides containing modifications 
such as polynucleotide deletions, substitutions or additions; and any polypeptide modification 
resulting from the variant polynucleotide sequence. A polynucleotide of the present invention 

2 0 also may have a coding sequence which is a naturally occurring allelic variant of the coding 

sequence provided herein. 

In addition, the coding sequence for the polypeptide may be fused in the same reading 
frame to a polynucleotide sequence which aids in expression and secretion of a polypeptide 
from a host cell, for example, a leader sequence which functions as a secretory sequence for 

2 5 controlling transport of a polypeptide from the cell. The polypeptide having a leader sequence 

is a preprotein and may have the leader sequence cleaved by the host cell to form the 
polypeptide. The polynucleotides may also encode for a proprotein which is the protein plus 
additional amino acid residues at the N-terminus. A protein having a prosequence is a 
proprotein and may in some cases be an inactive form of the protein. Once the prosequence is 

3 0 cleaved an active protein remains. Thus, the polynucleotide of the present invention may 

encode for a protein, or for a protein having a prosequence or for a protein having both a 
presequence (leader sequence) and a prosequence. 

The polynucleotides of the present invention may also have the coding sequence fused 
in frame to a marker sequence which allows for purification of the polypeptide of the present 
3 5 invention. The marker sequence may be a hexa-histidine tag supplied by a pQE-9 vector to 

provide for purification of the polypeptide fused to the marker in the case of a bacterial host, ^ 
or, for example, the marker sequence may be a hemagglutinin (HA) tag when a mammalian 
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host, e.g. COS-7 cells, is used. The HA lag corresponds to an epitope derived from the 
influenza hemagglutinin protein. See, for example, I. Wilson, et al., Ce]] 37:767 (1984). 

It is contemplated that polynucleotides will be considered to hybridize to the sequences 
. provided herein if there is at least 50%, and preferably at least 70% identity between the 
5 polynucleotide and the sequence. 

Probes consmicted according to the polynucleotide sequences of the present invention 
can be used in various assay methods to provide various types of analysis. For example, such 
probes can be used in Fluorescent In Situ Hybridization (FISH) technology to perform 
chromosomal analysis, and used to identify cancer-specific structural alterations in the 
1 0 chromosomes, such as deletions or translocations that are visible from chromosome spreads or 
detectable using PCR-generated and/or allele specific oiigonulcleotides probes, allele specific 
amplification or by direct sequencing. Probes also can be labeled with radioisotopes, directly- 
or indirectly- detectable haptens, or fluorescent molecules, and utilized for in situ hybridization 
studies to evaluate the mRN A expression of the gene comprising the polynucleotide in fixed 

1 5 tissue specimens or cells. 

The present invention further relates to an MMP19 polypeptide which has the deduced 
amino acid sequence as provided herein, as well as fragments, analogs and derivatives of such 
polypeptide. The polypeptide of the present invention may be a recombinant polypeptide, a 
natural purified polypeptide or a synthetic polypeptide. The fragment, derivative or analog of 

2 0 the MMP19 polypeptide may be one in which one or more of the amino acid residues is 

substituted with a conserved or non-conserved amino acid residue (preferably a conserved 
amino acid residue) and such substituted amino acid residue may or may not be one encoded 
by the genetic code; or it may be one in which one or more of the amino acid residues includes 
a substitucnt group; or it may be one in which the polypeptide is fused with another 

2 5 compound, such as a compound to increa.se the half-life of the polypeptide (for example, 

polyethy lene glycol); or it may bc.one in which the additional amino acids .are fused to the 
polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
purification of the polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogs are within the scope of the present invention. The polypeptides and polynucleotides of 

3 0 the present invention are preferably provided in an isolated form, and preferably purifieij. 

Thus, a polypeptide of the present invention may have an amino acid sequence that is 
identical to that of the naturally occurring polypeptide or that is different by minor variations 
due to one or more amino acid substitutions. The variation may be a "conservative change" 
typically in the range of about 1 to 5 amino acids! wherein the substituted amino acid has 
3 5 similar structural or chemical properties, eg, replacement of leucine with isoleucine or 

threonine with serine. In contrast, variations may include nonconservative changes, eg, 
replacement of a glycine with a tryptophan. Similar minor variations may ai.so include amino 



wo 98/40475 



PCT/US98/04694 



18 

acid deletions or insertions, or both. Guidance in determining which and how many amino 
acid residues may be substituted, inserted or deleted without changing biological or 
immunological activity may be found using computer programs well known in the art, for 
example, DNASTAR software (DNASTAR Inc., Madison WI). 
5 Preferably, a polypeptide of the present invention is one having at least 40% identity, 

more preferably, at least 50% identity, even more preferably at least 60% identity and even 
more preferably, at least 70% identity to SEQ ID NO: 10. Such a polypeptide may also 
possess the ability to cleave at least the substrate described as SEQ ID NO: 13. 

The present invention also provides an antibody produced by using a purified MMP 19 
1 0 polypeptide of which at least a portion of the polypeptide is encoded by MMP 19 

polynucleotide selected from the polynucleotides provided herein. These antibodies may be 
used in the methods .provided herein for the detection of MMP 19 polypeptides in test samples. 
The antibody also may be used for therapeutic purposes, for example, in neutralizing the 
activity of a MMP 19 polypeptide in conditions associated with altered or abnormal expression 

1 5 of the MMP 1 9 gene. 

This invention also provides teachings as to the production of the polynucleotides and 
polypeptides provided.herein. 
Probe Assavs 

The sequences provided herein may be used to produce probes which can be used in 

2 0 a.ssays for the detection of nucleic acids in test samples. The probes may be designed from 

conserved nucleotide regions of the polynucleotides of interest or from non-conserved 
nucleotide regions of the polynucleotide of interest. The design of such probes for 
optimization in assays is within the skill of the routineer. Generally, nucleic acid probes are 
developed from non-conserved or unique regions when maximum specificity is desired, and 

2 5 nucleic acid probes are developed from conserved regions when assaying for nucleotide 

regions that are closely related to, for example, different members of a multigene family or in 
related species like moase and man. - . 

The polymerase chain reaction (PGR) is a technique for amplifying a desired nucleic 
acid sequence (target) contained in a nucleic acid or mixture thereof. In PGR, a pair of primers 

3 0 are employed in excess to hybridize at the outside ends of complementary strands of the target 

- nucleic acid. The primers are each extended by a polymerase using the target nucleic acid as a 
template. The extension products become target sequences themselves, following dissociation 
from the original target strand. New primers then are hybridized and extended by a 
polymerase, and the cycle is repeated to geometricaii y inc: jasc the number of target sequence 
3 5 molecules. PGR is disclo.sed in U.S. patents 4,683, 195 and 4,683,202, which are 
incorporated herein by reference. , 
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The Ligase Chain Reaction (LCR) is an alternate method for nucleic acid amplification. 
In LCR, probe pairs are used which include two primary (first and second) and two secondary 
(third and fourth) probes, all of which are employed in molar excess to target. The first probe 
hybridizes to a first segment of the target strand and the second probe hybridizes to a second 
5 segment of the target strand, the first and second segments being contiguous so that the 

primary probes abut one another in 5* phosphate-3'hydroxyl relationship, and so that a ligase 
can covalently fuse or ligate ihe two probes into a fused product. In addition, a third 
(secondary) probe can hybridize to a portion of the first probe and a fourth (secondary) probe 
can hybridize to a portion of the second probe in a similar abutting fashion. Of course, if the 
1 0 target is initially double stranded, the secondary probes also will hybridize to the target 

complement in the first instance. Once the ligated strand of primary probes is separated from 
the target strand, it will hybridize with the third and fourth probes which can be ligated to form 
a complementary, secondary ligated product. It is important to realize that the ligated products 
are functionally equivalent to either the target or its complement. By repeated cycles of 

1 5 hybridization and ligation, amplification of the target sequence is achieved. This technique is 

described more completely in EP-A- 320 308 to K. Backman published June 16, 1989 and 
EP-A-439 1 82 to K. Backman et al, published July 31, 1 99 1 , both of which are incorporated 
herein by reference. 

For amplification of mRNAs, it is within the scope of the present invention to reverse 

2 0 transcribe mRNA into cDNA followed by polymerase chain reaction (RT-PCR); or, to use a 

single enzyme for both steps as described in U.S. Patent No. 5,322,770, which is 
incorporated herein by reference; or reverse transcribe mRNA into cDNA followed by 
asymmetric gap ligase chain reaction (RT-AGLCR) as described by R. L. Marshall, ct al., 
PCR Methods and Applications 4: 80-84 (1994), which also is incorporated herein by 

2 5 reference. 

Other known amplification methods which can be utilized herein include but are not 

limited to the so-called "NASBA" or '^SR" technique described in PNAS USA 87:1874-1878 
(1990) and al.so described in Nature 350 (No. 6313):9l-92 (1991); Q-heta amplification as 
described in published European Patent Application (EPA) No. 4544610; .strand displacement 

3 0 amplification (as described in G. T. Walker et ai., Clin. Chem . 42:9-13 [1996]) and European 

Patent Application No. 684315; and target mediated amplification, as described by PCT 
Publication WO 9322461. 

In one embodiment, the present invention generally comprises the steps of contacting a 
lest .sample su.spected of containing a target pblynucleotivic- .sequence with amplification 
3 5 reaction reagents comprising an amplification primer, and a detection probe that can hybridize 
with an internal region of the amplicon sequences. Probes and primers employed according to 
the method herein provided are labeled with capture and detection labels wherein probes are 
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labeled with one type of label and primers are labeled with the other type of label. 
Additionally, the primers and probes are selected such that the probe sequence has a lower melt 
temperature than the primer sequences. The amplification reagents, detection reagents and lest 
sample are placed under amplification conditions whereby, in the presence of target sequence, 
5 copies of the target sequence (an amplicon) are produced. In the usual case, the amplicon is 
double stranded because primers are provided to amplify a target sequence and its 
complementary strand. The double stranded amplicon then is thermally denatured to produce 
single stranded amplicon members. Upon formation of the single stranded amplicon 
members, the mixture is cooled to allow the formation of complexes between the probes and 

1 0 single stranded amplicon members. 

As the single stranded amplicon sequences and probe sequences are cooled, the probe 
sequences preferentially bind the single stranded amplicon members. This finding is 
counterintuitive given that the probe sequences are generally selected to be shorter than the 
primer sequences and therefore have a lower melt temperature than the primers. Accordingly, 
15 the nielt temperature of the amplicon produced by the primers should also have a higher mell 
temperature than the probes. Thus, as the mixture is cooled, the re-formation of the double 
stranded amplicon is expected although as previously stated, this is not the case. Instead, the 
probes are found to preferentially bind the single stranded amplicon members. Moreover, this 
preference of probe/single stranded amplicon binding exists even when the primer sequences 

2 0 are added in excess of the probes. 

After the probe/single stranded amplicon member hybrids are formed, they are 
detected. Standard heterogeneous assay formats are suitable for delecting the hybrids using 
the detection labels and capture labels present on the primers and probes. The hybrids can be 
bound to a solid phase reagent by virtue of the capture label and detected by virtue of the 

2 5 detection label, in cases where the detection label is directly detectable, the presence of the 

hybrids on the solid phase can be detected by causing the label to produce a detectable signal, 
if necessary, and detecting the signal. In cases where the label is not directly delectable, the 
captured hybrids can be contacted with a conjugate, which generally comprises a binding 
member attached to a directly detectable label. The conjugate becomes bound to the complexes 

3 0 and the conjugates presence on the complexes can be detected with the directly delectable label. 

Thus, the presence of the hybrids on the solid phase reagent can be determined. Tho.sc skilled 
in the art will recognize that wash steps may be employed to wash away unhybridized 
amplicon or probe as well as unbound conjugate. 

A test sample is typically anything suspected of containing a target seiquence. Test 
3 5 samples can be prepared using methodologies well known in the art such as by obtaining a 

specimen from an individual and, if necessary,. disrupting any cells contained therein to release 
target nucleic acids. Although the target sequence is described as single stranded, it also is 
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contemplated to include the case where the target sequence is actually double stranded'but is 
merely separated from its complement prior to hybridization with the amplification primer 
sequences. In the case where PCR is employed in this method, the ends of the target 
sequences are usually known. In cases where LCR or a modification thereof is employed in 
5 the preferred method, the entire target sequence is usually known. Typically, the target 
sequence is a nucleic acid sequence such as, for example, RNA or DNA. 

The method provided herein can include well known amplification reactions that utilize 
thermal cycle reaction mixtures, particularly PCR and GLCR. Amplification reactions 
typically employ primers to repeatedly generate copies of a target nucleic acid sequence, which 
I 0 target sequence is usually a small region of a much larger nucleic acid sequence. Primers are 
themselves nucleic acid sequences that are complementary to regions of a target sequence. 
Under amplification conditions, these primers hybridize or bind to the complementary regions 
of the target sequence. Copies of the target sequence typically are generated by the process of 
primer extension and/or ligation which utilizes enzymes with polymerase or ligase activity, 

1 5 . separately or in combination, to add nucleotides to the hybridized primers and/or ligate adjacent 

probe pairs. The nucleotides that are added to the primers or probes, as monomers or 
preformed oligomers, are also complementary to the target sequence. Once the primers or 
probes have been sufficiently extended and/or ligated they are separated from the target 
sequence, for example, by healing the reaction mixture to a "melt temperature" which is one in 

2 0 which complementary nucleic acid strands dissociate. Thus, a sequence complementary to the 

target sequence is formed. 

A new amplification cycle then can take place to further amplify the number of target 
sequences by .separating any double stranded sequences, allowing primers or probes to 
hybridize to their respective targets, extending and/or ligating the hybridized primers or probes 

2 5 and re-separating. The complementary sequences that are generated by amplification cycles 

can serve as templates for primer extension or filling the gap of two probes to further amplify 
the number of target sequences. Typically, a reaction mixture is cycled between 20 and 100 
times, more typically, a reaction mixture is cycled between 25 and 50 times. The numbers of 
cycles can be determined by the routineer. In this manner, multiple copies of the target 

3 0 sequence and its complementary sequence are produced. Thus, primers initiate amplification 

of the target sequence when it is present under amplification conditions. 

Generally, two primers which are complementary to a portion of a target strand and its 
complement are employed in PCR. For LCR, four probes, two of which are complementary 
to a target sequence and two of which are similariy complementary to the targets complement, 
3 5 are generally employed. In addition to the primer sets and enzymes previously mentioned, a 
nucleic acid amplification reaction mixture may also comprise other reagents which are well 
known and include but are not limited to: enzyme cofactors such as manganese; magnesium; 
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salts; nicotinamide adenine dinucieotide (NAD); and deoxynucleotide triphosphates (dNTPs) 
such as for example deoxyadenine triphosphate, deoxyguanine triphosphate, deoxycytosine 
triphosphate and deoxythyminc triphosphate. 

While the amplification primers initiate amplification of the target sequence, in some 
5 cases, the detection (or hybridization) probe is not involved in amplification. Detection probes 
are generally nucleic acid sequences or uncharged nucleic acid analogs such as, for example, 
peptide nucleic acids which are disclosed in International Patent Application WO 92/20702; 
morpholino analogs which are described in U.S. Patents Nos 5,185,444, 5,034,506, and 
5,142,047; and the like. Depending upon the type of label carried by the probe, the probe is 
1 0 employed to capture or detect the amplicon generated by the amplification reaction. The probe 
is not involved in amplification of the target sequence and therefore may have to be rendered 
"non-extendable" in that additional dNTPs cannot be added to the probe. In and of themselves 
analogs usually are non-extendable and nucleic acid probes can be rendered non-extendable by 
modifying the 3' end of the probe such that the hydroxyl group is no longer capable of 

1 5 participating in elongation. For example, the 3' end of the probe can be functionalized with the 

capture or detection label to thereby consume or otherwise block the hydroxyl group. 
Alternatively, the 3' hydroxyl group simply can be cleaved, replaced or modified. U.S. Patent 
Application Serial No. 07/049,061 filed April 19, 1993 and incorporated herein by reference 
describes modifications which can be used to render a probe non-extendable. 

2 0 Accordingly, the ratio of primers to probes is not important. Thus, either the probes or 

primers can be added to the reaction mixture in excess whereby the concentration of one would 
be greater than the concentration of the other. Alternatively, primers and probes can be 
employed in equivalent concentrations. Preferably, however, the primers are added to the 
reaction mixture in excess of the probes. Thus, primer to probe ratios of, for example, 5: 1 and 

2 5 20: 1 are preferred. 

While the length of the primers and probes can vary, the probe sequences are selected 
such that they have a lower melt temperature than the primer sequences. Hence, the primer 
sequences are generally longer than the probe sequences. Typically, the primer .sequences are 
in the range of between 20 and 50 nucleotides long, more typically in the range of between 20 

3 0 and 30 nucleotides long. The typical probe is in the range of between 10 and 25 nucleotides 

long. 

Alternatively, a probe may be involved in the amplifying a target sequence, via a 
process kno'- • "nested PCR". In nested PGR, the probe has characteristics which are 
similar to those of the first and second primers normally used for amplification (such as length, 
3 5 melting temperature etc.) and as such, may itself serve as a primer in an amplification reaction. 
Generally in nested PCR,:a first pair of primers (P] and P2) are employed to form primary 
extension products. One of the primary primers (for example, Pi) may optionally be a capture 
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primer (i.e. linked to a member of a first reactive pair), whereas the other primary primer (P2) 
is not. A secondary extension product is then formed using the P) primer and a probe (P2') 
which may also have a capture type label (such as a member of a second reactive pair) or a 
detection label at its 5' end. The probe is complimentary to and hybridizes at a site on the 
5 template near or adjacent (but not overlapping) the site where the 3* terminus of P2 would 
hybridize if it was still in solution. Thus, a labeled primer/probe set generates a secondary 
product which is shorter than the primary extension product. Furthermore, the secondary 
product may be detected either on the basis of its size or via its labeled ends (by detection 
methodologies well known to those of ordinary skill in the art). In this process, probe and 
I 0 primers are generally employed in equivalent concentrations. 

Various methods for synthesizing primers and probes are well known in the art. 
Similariy, methods for attaching labels to primers or probes are also well known in the art. 
For example, it is a matter of routine to synthesize desired nucleic acid primers or probes using 
conventional nucleotide phosphoramidite chemistry and instruments available from Applied 

1 5 Biosystems, Inc., (Foster City, CA), Dupont (Wilmington, DE), or Milligen (Bedford MA). 

Many methods have been described for labeling oligonucleotides such as the primers or probes 
of the present invention. Enzo Biochemical (New York» NY) and Clontech (Palo Alto, CA) 
both have described and commercialized probe labeling techniques. For example, a primary 
amine can be attached to a 3' oligo terminus using 3'-Amine:0N CPG'^'^ (Clontech, Palo Alto. 

2 0 CA). Similariy, a primary amine can be attached to a 5' oligo terminus using Aminomodifier 

II® (Clontech). The amines can be reacted to various haptens using conventional activation 
and linking chemistries. In addition, copending applications US. Serial Nos. 625,566, filed 
December 1 1, 1990 and 630,908, filed December 20, 1990, which are each incorporated 
herein by reference, teach methods for labeling probes at their 5* and 3' termini, respectively. 

2 5 Publications WO92/I0505, published 25 June !992 and WO 92/1 1388 published 9 July 1992 

teach methods for labeling probes at their 5' and 3* ends, respectively. According to one 
known method for labeling an oligonucleotide, a label-phosphoramidite reagent is prepared 
and used to add the label to the oligonucleotide during its synthesis. See, for example, N.T. 
Thuong et al., Tet. Letters 29(46):5905-5908 (1988); or J. S. Cohen el al., published U.S. 

3 0 Patent Application 07/246,688 (NTIS ORDER No. PAT-APPL-7-246,688) (1989). 

Preferably, probes are labeled at their 3' and 5* ends. 

Capture labels are carried by the primers or probes and can be a specific binding 
member which forms a binding pair with the ^ lid phase reagent's specific binding member. It 
will be understood, of course that the primer v: probe itself may serve as the capture label. 
3 5 For example, in the case where a solid phase reagent's binding member is a nucleic acid 

sequence, it may be selected such that it binds a complementary portion of the primer or probe 
to thereby immobilize the primer or probe to the solid phase. In cases where th6 probe itself 
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serves as the binding member, those skilled in the art will recognize that the probe will contain 
a sequence or "tail" that is not complementary to the single stranded amplicon members. In the 
case where the primer itself serves as the capture label, at least a portion of the primer will be 
free to hybridize with a nucleic acid on a solid phase because the probe is selected such that it 
5 is not fully complementary to the primer sequence. 

Generally, probe/single stranded amplicon member complexes can be detected using 
techniques commonly employed to perform heterogeneous immunoassays. Preferably, in this 
embodiment, detection is performed according to the protocols used by the commercially 
available Abbott LCx® instrumentation (Abbott Laboratories, Abbott Park, IL). 
1 0 The primers and probes disclosed herein are useful in, typical PCR assays, wherein the 

test sample is contacted with a pair of primers, amplification is performed, the hybridization 
probe is added, and detection is performed. 

Another method provided by the present invention comprises contacting a lest sample 
vvith a plurality of polynucleotides wherein at least one polynucleotide is provided herein, 

1 5 hybridizing the test sample with the plurality of polynucleotides and detecting the hybridization 

complexes. The hybridization complexes are identified and quantitaied to compile a profile 
which is indicative of cancer, metasta.ses, arthritis or inflammatory conditions. Expressed 
RNA sequences may further be detected by reverse transcription and amplification of the DNA 
product by pnxedures well-known in the art, including polymerase chain reaction (PCR). 

2 0 Antisense an d Gene Theranv. 

The present invention also encompasses the use of gene therapy methods for the 
introduction of anti-sense MMP19 gene derived molecules such as polynucleotides or 
oligonucleotides of the present invention into patients having conditions (such as cancer, 
arthritis or inflammation) that are associated with bnormal expression of MMP19 

2 5 polynucleotides. These molecules, including antisense RNA and-DNA fragments and 

ribozymcs, are designed to inhibit the translation of a MMP19 polynucleotide mRNA, and may 
be used therapeutically in the treatment of conditions aissociated with altered or abnormal 
expression of the MMP 19 polynucleotide. 

Alternatively, the oligonucleotides de.scribed above can be delivered to cells by 

3 0 procedures in the art such that the anti-sense RNA or DNA is subsequently expressed in vivo 

whereby it inhibits production of MMP 19 polypeptide in the manner described above. 
Antisense constructs to MMP 19 polynucleotide, therefore, reverse the action of MMP 19 
transcripts and may be used for treating cancer, arthritis, inflammation and related disease 
Slates. These antisense constructs may also be used to treat tumor metastases. 
3 5 Antisense technology can be used to control gene expression through triple-helix 

formation or antisense DNA or RNA, both of which methods are based on binding of a 
polynucleotide to DNA or RNA. For example, the 5* coding portion of the polynucleotide 
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sequence, which encodes for the polypeptide of the present invention, is used to design an 
antisense RNA oligonucleotide of from 10 to 40 base pairs in length. A DNA oligonucleotide 
is designed to be connplennentary to a region of the gene involved in transcription, thereby 
preventing transcription and the production of MMP19 polypeptide. For triple helix, see. for 
5 example. Lee cl ai, Nucl. Acids Res . 6:3073 ( 1 979); Cooney et al. Science 24 1 :456 ( 1 988); 
and Dervan et al. Science 251:1360 (1991) The antisense RNA oligonucleotide hybridizes to 
the mRNA in vivo and blocks translation of an mRNA molecule into the MMP 19 polypeptide. 
For antisense, see, for example, Okano, J. Neurochem . 56:560 (1991); and 
"Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression", CRC Press, Boca 
1 0 Raton, Fla. (1988). Antisense oligonucleotides act with greater efficacy when modified to 
contain artificial iniemucleotide linkages which render the molecule resistant to nucleolyiic 
' cleavage. Such artificial intemucleotide linkages include but are not limited to 
methylphosphonate, phosphorothiolate and phosphoroamydate intemucleotide linkages. 
The present invention also encompasses gene therapy whereby the gene encoding 

1 5 MMP 19 is regulated in a patient. Various methods of transferring or delivering DNA to cells 

for expression of the gene product protein, otherwise referred to as gene therapy, are disclosed 
in "Gene Transfer into Mammalian Somatic Cells in vivo'\ N. Yang, Grit. Rev. Biotechp. 
12(4): 335-356 (1992). which is hereby incorporated by reference. Gene therapy 
encompasses incorporation of DNA sequences into somatic cells or germ line cells for use in 

2 0 either ex vivo or in vivo therapy. Gene therapy functions to replace genes, augment normal or 

abnormal gene function, and to combat infectious di.seases and other pathologies. 

Strategies for treating medical problems with gene therapy include therapeutic strategies 
such as identifying the defective gene and then adding a functional gene to either replace the 
function of the defective gene or to augment a slightly functional gene; or prophylactic 

2 5 strategies, such as adding a gene which encodes a protein product that will treat the condition 

or that will make the tissue or organ more susceptible to a treatment regimen. 

Many protocols for transfer of MMP 19 DNA or MMPl 9 regulatory sequences arc 
envisioned in this invention. Transfection of promoter sequences, other than one specifically 
associated with MMP19, or other sequences which would increa.se production of MMP19 

3 0 protein are also envisioned as methods of gene therapy. An example of this technology is 

found in Transkaryotic Therapies. Inc.. of Cambridge, Massachusetts, using homologous 
recombination to insert a "genetic switch" that turns on an erythropoietin gene in cells. See 
Genetic En^ineerint? News . April 15. 1994. Such "genetic switches" could be used to activate 
MMP 19 (or a MMP 19 receptor) in cells not normally expressing these proteins, 
3 5 Gene transfer methods for gene therapy fall into three broad categories: ( 1 ) physical 

(e.g., electroporation, direct gene transfer and particle bombardment), (2) chemical (e.g. lipid- 
based carriers and other non-viral vectors) and (3) biological (e.g. virus derived vectors). For 
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example, non-viral vectors such as liposomes coated with DNA may be directly injected 
intravenously into the patient. It is believed that the liposome/DNA complexes are 
concentrated in the liver where they deliver the DNA to macrophages and Kupffer cells. 
Additionally, vectors or the "naked" DNA of the gene may be directly injected into the desired 
5 organ, tissue or tumor for targeted delivery of the therapeutic DNA. 

Gene therapy melhodologies can also be described by delivery site. Fundamental ways 
to deliver genes include ex vivo gene transfer, in vivo gene transfer, and in vitro gene transfer. 
In ex vivo gene transfer, cells are taken from the patient and grown in cell culture. The DNA 
is transfected into the cells, the transfected cells are expanded in number and then reimplanted 
1 0 in the patient. In in vitro gene transfer, the transformed cells are cells growing in culture, such 
as tissue culture cells, and not particular cells from a particular patient. These "laboratory 
cells" are transfected, the transfected cells are selected and expanded for either implantation 
into a patient or for other uses. In vivo gene transfer involves introducing the DNA into the 
cells of the patient when the cells are within the patient. All three of the broad based categories 

1 5 de.scribed above may be used to achieve gene transfer in vivo, ex vivo, and in vitro. 

Mechanical (i.e. physical) methods of DNA delivery can be achieved by direct injection 
of DNA, such as microinjection of DNA into germ or somatic cells, pneumatically delivered 
DNA-coated particles, such as the gold particles used in a "gene gun," and inorganic chemical 
approaches such as calcium phosphate transfection. It has been found that physical injection 

2 0 of plasmid DNA into muscle cells yields a high percentage of cells which arc transfected and 

have a sustained expression of marker genes. The plasmid DNA may or may not integrate into 
the genome of the cells". Non-integration of the transfected DNA would allow the transfection 
and expression of gene product proteins in terminally differentiated, non-proliferativc tissues 
for a prolonged period of time without fear of mutational insertions, deletions, or alterations in 

2 5 the cellular or mitochondrial genome. Long-term, but not necessarily permanent,. transfer of 

therapeutic genes into specific cells may provide treatments for genetic diseases or for. . 
prophylactic use. The DNA could be reinjected periodically to maintain the gene product level 
without mutations occurring in the genomes of the recipient cells. Non-integration of 
exogenous DN As may allow for the presence of several different exogenous DNA constructs 

3 0 within one cell with all of the constructs expressing various gene products. 

Particle-mediated gene transfer may also be employed for injecting DNA into cells, 
tissues and organs. With a panicle bombardment device, or "gene gun," a motive force is . 
generated to accelerate DN A-coated high density particles (such gold or tungsten) to a high 
velocity that allows penetration of the target organs, tissues or cells. Electroporation lor gene 
3 5 transfer uses an electrical current to make cells or tissues susceptible to electroporation- 

mediated gene transfer. A brief electric impulse with a given field strength is used to increase 
the permeability of a membrane in such a way that DNA molecules can penetrate into the cells. 
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The techniques of particle-mediated gene transfer and electroporation are well known to those 
of ordinary skill in the art. 

Chemical methods of gene therapy involve carrier mediated gene transfer through the 
use of fusogenic lipid vesicles such as liposomes or other vesicles for membrane fusion. A 
carrier harboring a DNA of interest can be conveniently introduced into body fluids or the 
bloodstream and then site specifically directed to the target organ or tissue in the body. 
Liposomes, for example, can be developed which are cell specific or organ specific. The 
foreign DNA carried by the liposome thus will be taken up by those specific cells. Injection of 
immunoliposomes that are targeted to a specific receptor on certain cells can be used as a 
convenient method of inserting the DNA into the cells bearing the receptor. Another carrier 
system that has been used is the asialoglycoprotein/polylysine conjugate system for carrying 
DNA to hepatocytes for in vivo gene transfer. 

Transfected DNA may also be complexed with other kinds of carriers so that the DNA 
is carried to the recipient cell and then resides in the cyiopla.sm or in the nucleoplasm of the 
recipient cell. DNA can be coupled to carrier nuclear proteins in specifically engineered vesicle 
complexes and carried directly into the nucleus. 

Carrier mediated gene transfer may also involve the use of lipid-based proteins which 
are not liposomes. For example, lipofectins and cytofectins are lipid-based positive ions that 
bind to negatively charged DNA, forming a complex that can ferry the DNA across a cell 
membrane. Another method of carrier mediated gene transfer involves receptor-based 
endocytosis. In this method, a ligand (specific to a ceil surface receptor) is made to form a 
complex with a gene of interest and then injected into the bloodstream; target cells that have the 
ceil surface receptor will specifically bind the ligand and transport the ligand-DNA complex 
into the cell. 

Biological gene therapy methodologies usually employ viral vectors to insert genes into 
cells. The term "vector" as used herein in the context of biological gene therapy means a 
carrier that can contain or associate with specific polynucleotide sequences and which 
functions to transport the specific polynucleotide sequences into a cell. The transfected cells 
may be cells derived from the patient's normal tissue, the patient's diseased tissue, or may be 
non-patient cells. Examples of vectors include plasmids and infective microorganisms such as 
viruses, or non-viral vectors such as the ligand-DNA conjugates, liposomes, and lipid-DNA 
complexes discussed above. 

It may be desirable that a recombinant DNA molecule comprising a MMP19 DNA 
sequence is operatively linked to an expression control sequence to form an expression vector 
capable of expressing MMP19. Alternatively, gene regulation of MM? 1 9 may be 
accomplished by administering protejns that bind to control regions associated with the 
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MMP19 gene, or its corresponding RNA transcript to modify the rate of transcription or 
translation. 

Viral vectors that have been used for gene therapy protocols include but are not limited 
to, retroviruses, other RNA viruses such as poliovirus or Sindbis virus, adenovirus, adeno- 
5 associated virus, herpes viruses, SV 40, vaccinia and other DNA viruses. Replication- 
defective murine retroviral vectors are the most widely utilized gene transfer vectors. Murine 
leukemia retroviruses are composed of a single strand RNA complexed with a nuclear core 
protein and polymerase (pel) enzymes, encased by a protein core (gag) and surrounded by a 
glycoprotein envelope (env) that determines host range. The genomic structure of retroviruses 

1 0 include the gag, pol, and env genes enclosed at by 5' and 3' long terminal repeats (LTR). 

Retroviral vector systems exploit the fact that a minimal vector containing the 5' and 3' LTRs 
and the packaging signal are sufficient to allow vector packaging, infection and integration into 
target cells providing that the viral structural proteins are supplied in trans in the packaging cell 
line. Fundamental advantages of retroviral vectors for gene transfer include efficient infection 
15 • and gene expression in most cell types, precise single copy vector integration into target cell 
chromosomal DNA, and ease of manipulation of the retroviral genome. For example, altered 
retrovirus vectors have been used in ex vivo methods to introduce genes into peripheral and 
tumor-infiltrating lymphocytes, hepatocytes, epidermal cells, myocytes, or other somatic cells 
(which may then introduced into the patient to provide the gene product from the inserted 

2 0 DNA). 

The adenovirus is composed of linear, double stranded DNA complexed with core 
proteins and surrounded with capsid proteins. Advance.s in molecular virology have led to the 
ability to exploit the biology of these organisms to create vectors capable of transducing novel 
genetic sequences into target cells in vivo . Adenoviral-based vectors will express gene 

2 5 product peptides at high levels. Adenoviral vectors have high efficiencies of infectivity, even 

with low titers of virus. Additionally, the virus is fully infective as a cell free virion so 
injection of producer cell lines are not necessary. Another potential advantage to adenoviral 
vectors is the ability to achieve long term expression of heterologous genes in vivo. 

Viral vectors have also been used to insert genes into cells using in vivo protocols. To 

3 0 direct tissue-specific expression of foreign genes, cis-acting regulatory elements or promoters 

that are known to be tissue specific can be used. Alternatively, this can be achieved using in 
situ delivery of DNA or viral vectors to specific anatomical sites in vivo. For example, gene 
transfer to blood vessels in vivo was achieved by implanting in vitro transduced endothelial 
cells in chosen sites on arterial walls. The virus infected rounding cells which also 
3 5 expressed the gene product. A viral vector can be delivered directly to the in vivo site, by a 
catheter for example, thus allowing only certain areas to be infected by the virus, and 
providing long-term, site specific gene expression. In vivo gene transfer using retrovirus 
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vectors has also been demonstrated in mammary tissue and hepatic tissue by injection of the 

altered virus into blood vessels leading to the organs. \ 

Drug Screening . 

The present invention also provides a method of screening a plurality of compounds 
5 for specific binding to a MMP19 polypeptide, or any fragment thereof, to identify at least one 
compound which specifically binds the MMP19 polypeptide. Such a method comprises the 
steps of providing at least one compound; combining the MMP 19 polypeptide with each 
compound under suitable conditions for a time sufficient to allow binding; and detecting 
MMP 1 9 polypeptide binding to each compound. 
1 0 The present invention also provides a method of screening a plurality of compounds 

for inhibition of the activity of a MMP 19 polypeptide, or. any fragment thereof, to identify at 
least one compound which specifically inhibits the activity of the MMP19 polypeptide. Such a 
method comprises the steps of providing at least one compound; combining the MMP 19 
polypeptide with each compound under suitable conditions for a time sufficient to allow • 

1 5 interaction or binding; and detecting MMP 19 polypeptide inhibition by each compound. 

The polypeptide or peptide fragment employed in such a test may either be free in 
solution, affixed to a solid support, borne on a cell surface or located intracellularly. One 
method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids which can express the polypeptide or peptide 

2 0 fragment. Drugs may be screened against such transformed cells in competitive binding or 

enzymatic inhibition assays. For example, the formation of complexes between a polypeptide 
and the agent being tested can be measured in either viable or fixed cells. 

The present invention thus provides methods of screening for drugs or any other agent 
which can be used to treat conditions associated with cancer, arthritis, or inflammation 

2 5 resulting from abnormal MMP 19 production. These methods comprise contacting the drug 

with a polypeptide or fragment thereof and assaying for either the presence of a complex 
between the agent and the polypeptide, or for the presence of a complex between the 
polypeptide and the cell. In competitive binding assays, the polypeptide typically is labeled. 
After suitable incubation, free (or uncomplexed) polypeptide or fragment thereof is separated 

3 0 from that present in bound form, and the amount of free or uncomplexed label is a measure of 

the ability of the particular drug to bind to polypeptide or to interfere with the poiypeptide/cell 
complex. 

The present invention also encompasses the use of competitive drug screening assays 
in which neutralizing antibodies capable of binding polypeptide specifically compete with a test 
3 5 drug for binding to the polypeptide or fragment thereof In this manner, the antibodies can be 
used to detect the presence of any polypeptide in the test sample which shares one or more 
antigenic determinants with a polypeptide provided herein. 
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Another technique for drug screening provides high throughput screening for 
compounds having suitable binding affinity to at least one polypeptide disclosed herein. 
Briefly, large numbers of different small peptide test compounds are synthesized on a solid 
phase, such as plastic pins or some other surface. The peptide lest compounds are reacted 
5 with polypeptide and washed. Polypeptide thus bound to the solid phase is delected by 

methods well-known in the art. Purified polypeptide can also be coaled directly onto plates for 
use in the drug screening techniques described herein. In addition, non-neulraiizing antibodies 
can be used to capture the polypeptide and immobilize it on the solid support. See, for 
example, EP 84/03564, published on September 13, 1984, which is incorporated herein by 
1 0 reference. 

Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active 
polypeptides of interest or of the small molecules including agonists, antagonists, or inhibitors 
with which they interact. Such structural analogs can be used to fashion drugs which are more 

1 5 active or stable forms of the polypeptide or which enhance or interfere with the function of a 

polypeptide in vivo, J. Hodgson, Bio/Technologv 9: 19-21 ( 1 99 1 ), incorporated herein by 
reference. 

For example, in one approach, the three-dimensional structure of a polypeptide, or of a 
polypeptide-inhibiior complex, is determined by x-ray crystallography, by computer modeling 

2 0 or, most typically, by a combination of the two approaches. Both the shape and charges of the 

polypeptide must be ascertained to elucidate the structure and to determine active site(s) of the 
molecule. Less often, useful information regarding the structure of a polypeptide may be 
gained by modeling based on the structure of homologous proteins. In both cases, relevant 
structural information is used to design analogous poiypeptide-like molecules or to identify 

2 5 efficient inhibitors 

Useful examples of rational drug design may include molecules which have improved 
activity or stability as shown by S. Braxton et al.. BjQchernistrv 3 1 :7796-7801 (1992), or 
which act as inhibitors, agonists, or antagonists of native peptides as shown by S. B. P. 
Athauda et al.. J Biochem. fTokvo) 1 13 (6):742-746 (1993), incorporated herein by reference. 

3 0 It also is possible to isolate a target-specific antibody, selected by an assay as described 

hereinabove, and then to determine its crystal structure. In principle this approach yields a 
pharmacophore upon which subsequent drug design can be based. It further is po.ssiblc to 
bypass protein crystallography altogether by generating anti-idiotypic antibodies ("anti-ids") to 
a functional, pharmacologically active antibody. As a mirror image of a mirror image, the 
3 5 binding site of the anti-id is an analog of the original receptor. The anti-id then could be used 
to identify and isolate peptides from banks of chemically or biologically produced peptides. 
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The isolated peptides then can act as the pharmacophore (that is, a prototype pharmaceutical 
drug). 

A sufficient amount of a recombinant polypeptide of the present invention may be made 
available to perform analytical studies such as X-ray crystallography. In addition, knowledge 
5 of the polypeptide amino acid sequence which are derivable from the nucleic acid sequence 
provided herein will provide guidance to those employing computer modeling techniques in 
place of or in addition to x-ray crystallography. 

Antibodies specific to the MMP19 polypepeptide may further be used to inhibit the 
biological action of the polypeptide by binding to the polypeptide. In this manner, the 
■ 1 0 antibodies may be used in therapy, for example, to treat cancer and its metastases. 

Further, such antibodies can detect the presence or absence of the MMP19 polypeptide 
and, therefore, are useful as diagnostic markers for the diagnosis of cancers, arthritis, and 
inflammatory conditions. Such antibodies may also function as a diagnostic marker for these 
conditions. The present invention also is directed to antagonists and inhibitors of the 

1 5 polypeptides of the present invention. The antagonists and inhibitors are tho.se which inhibit 

or eliminate the function of the polypeptide. Thus, for example, an antagonist may bind to a 
polypeptide of the present invention and inhibit or eliminate, its function. The antagonist, for 
example, could be an antibody against the polypeptide which eliminates the activity of the 
MMP 19 polypeptide by binding to the MM? 19 polypeptide, or in some cases the antagonist 

2 0 may be an oligonucleotide. Examples of small molecule inhibitors include but are not limited 

to small peptides or peptide-like molecules. 

The antagonists and inhibitors may be employed as a composition with a 
pharmaceutical ly acceptable carrier, including but not limited to saline, buffered saline, 
dextrose, water, glycerol, ethanol and combinations thereof. Administration of MMP 19 

2 5 polypeptide inhibitors arc preferably systemic. The present invention also provides an 

antibody which inhibits the action of such polypeptide. 
Recombinant Technplpgy. 

The present invention provides host cells and expression vectors comprising 
polynucleotides of the present invention and methods for the production of polypeptides they 
^3 0 encode. Such methods comprise culturing the host cells under conditions suitable for the 

expression of the MMP 19 polynucleotide and recovering the MMP 19 polypeptide from the cell 
culture. 

The present invention also provides vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the present 

3 5 invention and the production of polypeptides of the present invention by recombinant 
^ techniques. 
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Host cells are genetically engineered (transduced or transformed or transfected) with 
the vectors of this invention which may be a cloning vector or an expression vector. The 
vector may be in the form of a plasmid, a viral panicle, a phage, etc. The engineered host cells 
can be cultured in conventional nutrient media modified as appropriate for activating 
5 promoters, selecting transformants or amplifying the MMP19 genes. The culture conditions, 
such as temperature, pH and the like, arc those previously used with the host cell selected for 
expression, and will be apparent to the ordinarily skilled artisan. 

The polynucleotide of the present invention may be employed for producing a 
polypeptide by recombinant techniques. Thus, the polynucleotide sequence may be included 
10. in any one of a variety of expression vehicles, in particular vectors or plasmids for expressing 
a polypeptide. Such vectors include chromosomal, nonchromosomal and synthetic DNA 
sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA, viral DNA such as vaccinia, 
adenovirus, fowl pox virus, and pseudorabies. However, any other plasmid or vector may be 

1 5 used so long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into appropriate restriction endonuclease 
sites by procedures known in the art. Such procedures and others are deemed to be within the 
scope of those skilled in the art. The DNA sequence in the expression vector is operatively 

2 0 linked to an appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. 

Representative examples of such promoters include but are not limited to LTR or SV40 
promoter, the E. coli lac or irp, the phage lambda P sub L promoter and other promoters 
known to control expression of genes in prokaryotic or cukaryotic cells or their viruses. The 
expression vector also contains a ribosome binding site for translation initiation and a 

2 5 transcription terminator. The vector may also include appropriate sequences for amplifying 

expression, in addition, the expression vectors preferably contain a gene to provide a 
phenotypic trail for selection of transformed host cells such as dihydrofolate reductase or 
neomycin resistance for eukaryotic cell culture, or such as tetracycline or ampicillin resistance 
in E. coli. 

3 0 The vector containing the appropriate DNA sequence as hereinabove described, as well 

as an appropriate promoter or control sequence, may be employed to transform an appropriate 
host to permit the host to express the protein. As representative examples of appropriate hosts, 
there m: be mentioned: bacterial cells, such as E. coli . Salmonella tvphimurium: 
Strepton;vces sp. : fungal cells, such as yeast; insect cells such as Drosophila and Sf9; animal 
3 5 cells such as CHO, COS or Bowes melanoma; plant cells, etc. The selection of an appropriate 
host is deemed to be within the scope of those skilled in the an from the teachings provided 
herein. 
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More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
5 embodiment, the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters 
are known to those of skill in the art, and are commercially available. The following vectors 
are provided by way of example. Bacterial: pSPORTl (Life Technologies, Gaithersburg, 
MD), pQE70, pQE60, pQE-9 (Qiagen) pBs, phagescript, psiX174, pBluescript SK, pBsKS, 
1 0 pNHSa, pNH.16a, pNHlSa, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, 
pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cal, pOG44. pXTl i pSG 
(Stratagene) pSVK3, pBPV. pMSG, pSVL (Pharmacia). However, any other plasmid or. 
vector may be used as long as it is replicable and viable in the host. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 

1 5 transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 

pKK232-8 and pCM7. Particular named bacteria] promoters include lad, lacZ, T3, SP6, T7, 
gpt, lambda P sub R, P sub L and trp. Eukaryotic promoters include cytomegalovirus (CMV) 
immediate early, herpes simplex virus (HSV) thymidine kinase, early and late SV40, LTRs 
from retrovirus, and mouse metallothionein-L Selection of the appropriate vector and 

2 0 promoter is well within the level of ordinary skill in the art. 

In a further embodiment, the present invention provides host cells containing the 
above-described construct. The host cell can be a higher eukaryotic cell, such as a mammalian 
cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a prokaryotic cell, 
such as a bacterial cell. Introduction of the construct into the host cell can be effected by 

2 5 calcium phosphate iransfection, DEAE-Dextran mediated transfection, or electroporation (L. 

Davis et al., "Basic Methods in-Molecuiar Biology", 2nd edition, Appleton and Lang, 
Paramount Publishing. East Norwalk, CT (1994). 

The constructs in host cells can be used in a conventional manner to produce the gene 
product encoded by the recombinant sequence. Alternatively, the polypeptides of the invention 

3 0 can be synthetically produced by conventional peptide synthesizers. 

Proteins can be expressed in mammalian cells, yeast,' bacteria, or other cells' under the 
control of appropriate promoters. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are 
3 5 described by Sambrook et al., Molecular Cloning: A Laboratorv Manual . Second Edition, 
.(Cold Spring Harbor, N.Y., 1989), which is hereby incorporated by reference. 
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Transcription of a DNA encoding ihe polypeptides of the present invention by higher 
eukaryoies is increased by inserting an enhancer sequence into the vector. Enhancers are cis- 
acting elements of DNA, usually about from 10 to 300 bp. that act on a promoter to increase its 
transcription. Examples include the S V40 enhancer on the late side of the replication origin 
5 (bp 100 to 270), a cytomegalovirus early promoter enhancer, a polyoma enhancer on the late 
side of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and S. cerevisiae TRPl gene, and a promoter derived from a highly-expressed 
1 0 gene to direct transcription of a downstream structural sequence. Such promoters can be. 

derived from operons encoding glycolytic enzymes such as S.-phosphoglycerate kinase (PGK), 
alpha factor, acid phosphatase, or heat shock proteins, among others. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and termination 
sequences,, and preferably, a leader sequence capable of directing secretion of translated 

1 5 protein into the periplasmic space or extracellular medium. Optionally, the heterologous 

sequence can encode a fusion protein including an N-terminal identification peptide imparting 
desired characteristics, e.g., stabilization or simplified purification of expressed recombinant 
product. 

Useful expression vectors for bacterial use are constructed by inserting a structural 

2 0 DNA sequence encoding a desired protein together with suitable translation initiation and 

termination signals in operable reading phase with a functional promoter. The vector will 
comprise one or more phenotypic selectable markers and an origin of replication to ensure 
maintenance of the vector and to, if desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include E. coli . Bacillus subtilis . Salmonella tvphimurium 

2 5 and various species within the genera Pseudomonas . Strcptomvccs . and Staphvlococcus . 

although others may also be employed as a routine matter of choice. 

Useful expression vectors for bacterial use comprise a selectable marker and bacterial 
origin of replication derived from plasmids comprising genetic elements of the well-known 
cloning vector pBR322 (ATCC 37017). Other vectors include but are not limited to PKK223- 

3 0 3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEMl (Promega Biotec, Madison, Wl). 

These pBR322 "backbone" sections are combined with an appropriate promoter and the 
structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to an 
appropriate cell density, the selected promoter is derepressed by appropriate means (e.g., 
3 5 temperature shift or chemical induction), and cells are cultured for an additional period. Cells 
are typically harvested by centrifugation, disrupted by physical or chemical means, and the 
resulting crude extract retained for further purification. Microbial cells employed in expression 
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of proteins can be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents; such methods are well-known 
to the ordinary artisan. 

Various mammalian cell culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the COS-7 lines of monkey 
kidney fibroblasts described by Gluzman, C^l 23:175 (1981), and other cell lines capable of 
expressing a compatible vector, such as the C]27, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a suitable promoter and 
enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor 
and acceptor sites, transcriptional termination sequences, and 5' flanking nontranscribed 
sequences. DNA sequences derived from the SV40 viral genome, for example, SV40 origin, 
early promoter, enhancer, splice, and polyadenylation sites may be used to provide the 
required nontranscribed genetic elements. Representative, useful vectors include pRc/CMV " 
and pcDNA3 (available from Invitrogen, San Diego, CA). 

MMPI9 polypeptide is recovered and purified from recombinant cell cultures by 
known methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, hydroxyapatite chromatography or lectin chromatography. It is preferred to 
have low concentrations (approximately 0.1-5 mM) of calcium ion present during purification 
(Price, et al., J. Biol. Chem. 244:917 [1969]). Protein refolding steps can be used, as 
necessary, in completing configuration of the protein. Finally, high performance liquid 
chromatography (HPLC) can be employed for final purification steps. 

The polypeptides of the present invention may be naturally purified products expressed 
from a high expressing cell line, or a product of chemical synthetic procedures, or produced by 
recombinant techniques from a prokaryotic or eukaryotic host (for example, by bacterial, 
yeast, higher plant, insect and mammalian cells in culture). Depending upon the host 
employed in a recombinant production procedure, the polypeptides of the present invention 
may be glycosylated with mammalian or other eukaryotic carbohydrates or may be non- 
glycosylated. The polypeptides of the invention may also include an initial methionine amino 
acid residue. 

The starting plasmids can be constructed from available plasmids in accord with 
published, known procedures. In addition, equivalent plasmids to those described are known 
in the art and will be apparent to the ordinarily skilled artisan. 

i he following is the general procedure for the isolation and analysis of cDNA clones. 
In a particular embodiment disclosed herein, an EST homologous to a portion of human 
MMPs was identified by comparison with a consensus sequence to human MMPs. 
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The cDNA insen was sequenced in entirety, analyzed in detail set forth in the Examples 
. and is disclosed in the Sequence Listing as SEQ NO: 4. These polynucleotides may contain an 
entire open reading frame with or without associated regulatory sequences for a particular 
gene, or they may encode only a portion of the gene of interest. This is attributed to the fact 
5 that many genes are several hundred, and sometimes several thousand, bases in length and, 
with current technology, cannot be cloned in their entirety because of vector limitations, . 
incomplete reverse transcription of the first strand, or incomplete replication of the second 
strand. Contiguous, secondary clones containing additional nucleotide sequence may be 
obtained using a variety of methods known to those of skill in the art. 

1 0 Methods for DNA sequencing are well known in the art. Conventional enzymatic 

methods employ DNA polymerase, Klenow fragment, Sequenase (US Biochemical Corp, 
Cleveland, OH) or Taq polymerase to extend DNA chains from an oligonucleotide primer 
annealed to the DNA template of interest. Methods have been developed for the use of both 
single-stranded and double-stranded templates. The chain termination reaction products may 
15.' be elecu-ophorescd on urea/polyacrylamide gels and detected either by autoradiography (for 
radionucleotide labeled precursors) or by fluorescence (for fluorescent-labeled precursors). 
Recent improvements in mechanized reaction preparation, sequencing and analysis using the 
fluorescent detection method have permitted expansion in the number of sequences that can be 
determined per day using machines such as the Applied Biosystems 377 DNA Sequencers 
Applied Biosystems, Foster City, CA). 

The reading frame of the nucleotide sequence can be ascertained by several types of 
lyses. First, reading frames contained within the coding sequence can be analyzed for the 
•nee of start codon ATG and stop codons TGA, TAA or TAG. Typically, one reading • 
will continue throughout the major portion of a cDNA sequence while the other two 

2 5 ■ frames tend to contain numerous stop codons. In such cases reading frame 

o ■ nation is straightforward. In pther more difficult cases, further analysis is required. 

igorithms have been created to analyze the occurrence of individual nucleotide bases 
at eac, ative codon triplet. See, for example J. W. Fickett, Nuc Acids Res 10:5303 
( 1 982). ling DNA for particular organisms (bacteria, plants, and animals) tends to contain 

3 0 certain nucleotides within certain triplet periodicities, such as a significant preference for 

pyrimidines in the third codon position. These preferences have been incorporated into widely 
available software which can be used to determine coding potential (and frame) of a given 
stretch of DNA. The algorithm-derived information combined with start/stop codon 
inforination can be used to determine proper frame with a high degree of certainty. This, in 
3 5 turn, readily permits cloning of the sequence in the correct reading frame into appropriate 
expression vectors. - 
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The nucleic acid sequences disclosed herein may be joined to a variety of other 
polynucleotide sequences and vectors of interest by means of well established recombinant 
DNA techniques. See J. Sambrook et al., supra. Vectors of interest include cloning vectors, 
such as plasmids, cosmids, phage derivatives, phagemids, as well as sequencing, replication, 
and expression vectors, and the like. In general, such vectors contain an origin of replication 
functional in at least one organism, convenient restriction endonuclease digestion sites, and 
selectable markers appropriate for particular host cells. The vectors can be transferred by a 
variety of means known to those of skill in the art into suitable host cells which then produce 
the desired DNA, RNA or polypeptides. 

Occasionally, sequencing or random reverse transcription errors will mask the presence 
of the appropriate open reading frame or regulatory element. In such cases, it is possible to 
determine the correct reading frame by attempting to express the polypeptide and determining 
the amino acid sequence by standard peptide mapping and sequencing techniques. See, F. M. 
Ausubel, et ah. Current Protocols in Molecular Biology . John Wiley & Sons, New York, NY 
(1989). Additionally, the actual reading frame of a given nucleotide sequence may be 
determined by transfection of host cells with vectors containing all three potential reading 
frames. Only those cells with the nucleotide sequence in the correct reading frame will 
produce a peptide of the predicted length. 

The nucleotide sequences provided herein have been prepared by current, state-of-the- 
art, automated methods and as such may contain unidentified nucleotides. These will not 
present a problem to those skilled in the art who wish to practice the invention. Several 
methods employing standard recombinant techniques, described in J. Sambrook ( supra ) or 
periodic updates thereof, may be used to complete the missing sequence information. The 
same techniques used for obtaining a full length sequence, as described herein, may be used to 
obtain nucleotide sequence. 

Expression of a particular cDNA may be accomplished by subcloning the cDN A into 
an appropriate expression vector and iransfecting this vector into an appropriate expression* 
host. The cloning vector used for the generation of the cDNA library can be used for 
transcribing mRNA of a particular cDNA and contains a promoter for beta-galactosidase, an 
amino-terminal met and the subsequent seven amino acid residues of beta-galactosidase. 
Immediately following these eight residues is an engineered bacteriophage promoter useful for 
artificial priming and transcription and a number of unique restriction sites, including EcoR I, 
for cloning. The vector can be transfected into an appropriate host strain of E. colj. 

Induction of the isolated bacterial strain with isopropylthiogalactosidc (IPTG) using 
standard methods will produce a fusion protein which contains the first seven residues of beta- 
galactosidase, about 15 residues of linker, and the peptide encoded within the cDNA. Since 
cDNA clone inserts are generated by an essentially random process, there is one chance in 
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three that the included cDN A will lie in the correct frame for proper translation. If the'cDN A is 
not in the proper reading frame, the correct frame can be obtained by deletion or insertion of an 
appropriate number of bases by well known methods including in vitro mutagenesis, digestion 
with exonuclease III or mung bean nuclease, or oligonucleotide linker inclusion. . 
5 The cDNA can be shuttled into other vectors known to be useful for expression of 

protein in specific hosts. Oligonucleotide primers containing cloning sites and segments of 
DNA sufficient to hybridize to stretches at both ends of the target cDNA can be synthesized 
chemically by standard methods. These primers can then be used to amplify the desired gene 
segments by PCR. The resulting new gene segments can be digested with appropriate 
1 0 restriction enzymes under standard conditions and isolated by gel electrophoresis. Alternately, 
similar gene segments can be produced by digestion of the cDNA with appropriate restriction 
enzymes and filling in the missinjg gene segments with chemically synthesized 
oligonucleotides. Segments of the coding sequence from more than one gene can be ligated 
together and cloned in appropriate vectors to optimize expression of recombinant sequence. 

1 5 Suitable expression hosts for such chimeric molecules include but are not limited to, 

mammalian cells such as Chinese Hamster Ovary (CHO) and human 293 cells, insect cells 
such as Sf9 cells, yeast cells such as Saccharomvces cerevisiae . and bacteria such as E. coli . . 
For each of these cell systems, a useful expression vector may also include an origin of 
replication to allovy propagation in bacteria and a selectable marker such as the beta-lactamase 

2 0 antibiotic resistance gene to allow selection in bacteria. In addition, the vectors may include a 

second selectable marker such as the neomycin phosphotransferase gene to allow selection in 
: transfected eukaryotic host cells. Vectors for use in eukaryotic expression hosts may require 
the addition of 3' poly A tail if the sequence of interest lacks poly A. 

Additionally, the vector may contain promoters or enhancers which increase gene 

2 5 expression. Such promoters are host specific and include but are not limited to MMTV, 

SV40, or metallothionine promoters for CHO cells; trp, lac, tac or T7 promoters for bacterial 
hosts; or alpha factor, alcohol oxidase or PGH promoters for yeast. Adenoviral vectors with 
or without transcription enhancers, such as the rous sarcoma virus (RSV) enhancer, may be 
used to drive protein expression in mammalian cell lines. Once homogeneous cultures of 

3 0 recombinant cells are obtained, large quantities of recombinantly produced protcin'can be 

recovered from the conditioned medium and analyzed using chromatographic methods well . 
known in the art. An alternative method for the production of large amounts of secreted 
protein involves the transformation of mammalian embryos and the recovery of the 
recombinant protein from milk produced by transgenic cows, goats, sheep, etc. Polypeptides 
3 5 and closely related molecules may be expressed recombinantly in such a way as to facilitate 
protein purification. One approach involves expression of a chimeric protein which includes 
one or more additional polypeptide domains not naturally present on human polypeptides. 
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Such purification-facilitating domains include, but are not limited to, metal-chclating peptides . 
such as histidine-lryptophan domains that allow purification on immobilized metals, protein A 
domains that allow purification on immobilized immunoglobulin, and the domain utilized in the 
FLAGS extension/affinity purification system (Immunex Corp, Seattle, WA). The inclusion 
5 of a cleavable linker sequence such as Factor XA or enterokinase from Inviirogen (San Diego, 
CA) between the polypeptide sequence and the purification domain may be useful for 
recovering the polypeptide. 
Immunoassays. 

The polypeptides including their fragments or derivatives or analogs thereof of the 

1 0 present invention, or cells expressing them, can be in a variety of assays, many of which are 

described herein, for the detection of antibodies. They also can be used as an immunogen to 
produce antibodies. These antibodies can be, for example, polyclonal or monoclonal 
antibodies, chimeric, single chain and humanized antibodies, as well as Fab fragments, or the 
product of an Fab expression library. Various procedures known in the an may be used for 
15 the production of such antibodies and fragments. 

For example, antibodies generated against a polypeptide corresponding to a sequence 
of the present invention can be obtained by direct injection of the polypeptide into an animal or 
by administering the polypeptide to an animal such as a mouse, rabbit, goat or human. A 
TTiouse, rabbit or goat is preferred. The antibody so obtained then will bind the polypeptide 

2 0 itself. In this manner, even a sequence encoding only a fragment of the polypeptide can be 

used to generate antibodies that bind the native polypeptide. Such antibodies can then be used 
to isolate the polypeptide from test samples such as tissue suspected of containing that 
polypeptide. For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples include the 

2 5 hybridoma technique as described by Kohler and Milstein, Nature 256:495-497 ( 1 975), the 

irioma technique, the human B-cell hybridoma technique as described by Kozbor et al, 
Immun. Today 4:72 ( 1 983), and the EB V-hybridoma technique to produce human monoclonal 
antibodies as described by Cole, et al., in Monoclonal Antibodie s and Cancer Therapy. Alan 
R. Liss, Inc, New York. NY, pp. 77-96 (1985). Techniques described for the production of 

3 0 single chain antibodies can be adapted to produce single chain antibodies to immunogenic 

polypeptide products of this invention. Sec, for example, U.S. Pat. No. 4,946,778, which is 
incorporated herein by reference. 

Various assay formats may utilize the antibodies of the present inveir ion, including 
"sandwich" immunoassays and probe assays. For example, the monoclonal antibodies or 
3 5 fragment thereof of the present invention can be employed in various assay systems to 

determine the presence, if-any. of MMP 1 9 polypeptide in a test sample. For example, in a first 
assay format, a polyclonal or monoclonal antibody or fragment thereof, or a combination of 
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ihese antibodies, which has been coated on a solid phase, is contacted with a test sample, to 
form a first mixture. This first mixture is incubated for a time and under conditions sufficient 
to form antigen/antibody complexes. Then, an indicator reagent comprising a monoclonal or a 
polyclonal antibody or a fragment thereof, or a combination of these antibodies, to which a 
signal generating compound has been attached, is contacted with the antigen/antibody 
complexes to form a second mixture. This second mixture then is incubated for a tirne and 
under conditions sufficient to form antibody/antigen/antibody complexes. The presence of a 
MMP19 polypeptide antigen present in the test sample and captured on the solid phase, if any, 
is determined by detecting the measurable signal generated by the signal generating compound. 
The amount of MMP19 polypeptide antigen present in the test sample is proportional to the 
signal generated. 

Or, a polyclonal or monoclonal MMP 1 9 polypeptide antibody or fragment thereof, or a 
combination of these antibodies which is bound to a solid support, the test sarnple and an 
indicator reagent comprising a monoclonal or polyclonal antibody or fragments thereof, which 
specifically binds to MMP 1 9 polypeptide antigen, or a combination of these antibodies to 
which a signal generating compound is attached, are contacted to form a mixture. This mixture 
is incubated for a time and under conditions sufficient to form antibody/antigen/antibody 
complexes. The presence, if any, of MMP 19 polypeptide present in thctest sample and 
captured on the solid phase is determined by detecting the measurable signal generated by the 
signal generating compound. The amount of MMP19 polypeptide proteins pre.sent in the test 
sample is proportional to the signal generated. 

In another assay format, one or a combination of at least two monoclonal antibodies of 
the invention can be employed as a competitive probe for the detection of antibodies to MMP 19 
polypeptide protein. For example, MMP 19 polypeptide proteins such as the recombinant 
antigens disclosed herein, either alone or in combination, are coated on a solid phase. A test 
sample suspected of containing antibody to MMP19 polypeptide antigen then is incubated with 
an indicator reagent comprising a signal generating compound and at least one monoclonal 
antibody of the invention for a time and under conditions sufficient to form antigen/antibody 
complexes of either the test sample and indicator reagent bound to the solid phase or the 
indicator reagent bound to the solid phase. The reduction in binding of the monoclonal 
antibody to the solid phase can be quantitatively measured. 

In yet another detection method, each of the monoclonal or polyclonal antibodies of the 
present invention can be employed in the detection of MMPl 9 polypeptide antigens in fixed 
tissue sections, as well as fixed cells by immunohistochemical analysis. Cyio:_,..;:r.ical 
analysis wherein these antibodies are labeled directly (with, for example, fluorescein, colloidal 
gold, horseradish peroxidase, alkalinephosphata.se, etc.) or are labeled by using secondary 



wo 98/40475 



PCT/US98/04694 



labeled anti-species antibodies (with various labels as exemplified herein) to track the ' 
histopathology of disease also are within the scope of the present invention. 

In addition, these monoclonal antibodies can be bound to matrices similai- to CNBr- 
activated Sepharose and used for the affinity purification of specific MMP19 polypeptide 
5 proteins from cell cultures or biological tissues such as to purify recombinant and native 
MMP19 polypeptide antigens and proteins. 

The monoclonal antibodies of the invention can also be used for the generation of 
chimeric antibodies for therapeutic use, or other similar applications. 

The monoclonal antibodies or fragments thereof can be provided individually to detect 
1 0 MMP19 polypeptide antigens. Combinations of the monoclonal antibodies (and fragments 

thereoO provided herein also may be used together as components in a mixture or "cocktail" of 
at least one MMPl 9 polypeptide antibody of the invention with antibodies to other MMP19 
polypeptide regions, each having different binding specificities. Thus, this cocktail can 
include the monoclonal antibodies of the invention which are directed to MMPl 9 polypeptide 

1 5 proteins and other monoclonal antibodies to other antigenic determinants of the MMPl 9 

polypeptide genome. 

The polyclonal antibody or fragment thereof which can be used in the assay formats 
should specifically bind to a MMPl 9 polypeptide region or other MMPl 9 polypeptide proteins 
used in the assay. The polyclonal antibody used preferably is of mammalian origin; human, 

2 0 goat, rabbit or sheep ariti- MMPl 9 polypeptide polyclonal antibody can be used. Most 

preferably, the polyclonal antibody is rabbit polyclonal anti-MMP19 polypeptide antibody. 
The polyclonal antibodies used in the assays can be used either alone or as a cocktail of 
polyclonal antibodies. Since the cocktails used in the assay formats arc compri.sed of either 
monoclonal antibodies or polyclonal antibodies having different MMPl 9 polypeptide 

2 5 specificity, they would be useful for diagnosis, evaluation and prognosis of MMPl 9 

polypeptide condition, as well as for studying MMP19 polypeptide protein differentiation and 
specificity. 

It is contemplated and within the scope of the present invention that the MMPl 9 
polypeptide may be detectable in assays by use of a recombinant antigen as well as by use of a 

3 0 synthetic peptide or purified peptide, which contains an amino acid sequence of MMPl 9 

polypeptide. It also is within the scope of the present invention that different synthetic, 
recombinant or purified peptides identifying different epitopes of the MMPl 9 polypeptide can 
be used in combination in an assay to diagnose, evaluate, or prognose the disease condition. 
In this case, these peptides can be coated onto one solid phase, or each separate peptide may be 
3 5 coated on separate solid phases, such as microparticles, and then combined to form a mixture 
of peptides which can be later used in assays. Furthermore, it is contemplated that multiple 
peptides which define epitopes from different polypeptides may be used in combination to 
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make a diagnosis, evaluation, or prognosis of disease. Peptides coated on solid phases or 
labelled with detectable lable^ are then allowed to compete with peptides from a patient sample 
for a limited amount of antibody. A reduction in binding of the synthetic, recombinant, or 
purified peptides to the antibody (or antibodies) is an indication of the presence of MMP 1 9 
polypeptides in the patient sample which in turn indicates the presence of disease in the patient. 
Such variations of assay formats are known to those of ordinary skill in the art and are 
discussed herein below. 

In another assay format, the presence of antibody and/or antigen to MMP19 
polypeptide can be detected in a simultaneous assay, as follows. A test sample is 
simultaneously contacted with a capture reagent of a first analyte, wherein said capture reagent 
comprises a first binding member specific for a first analyte attached to a solid phase and a 
capture reagent for a second analyte, wherein said capture reagent comprises a first binding 
member for a second analyte attached to a second solid phase, to thereby form a mixture. This 
mixture is incubated for a time and under conditions sufficient to form capture reagent/first 
analyte and capture reagent/second analyte complexes. These so-formed complexes. then are - 
. contacted with an indicator reagent comprising a member of a binding pair specific for the first 
analyte labeled with a signal generating compound and an indicator reagent comprising a 
member of a binding pair specific for the second analyte labeled with a signal generating 
compound to form a second mixture. This second mixture is incubated for a time and under 
conditions sufficient to form capture reagent/first analyte/indicator reagent complexes and 
capture reagent/second analyte/indicator reagent complexes. The presence of one or more 
analytes is determined by detecting a signal generated in connection with the complexes formed 
on either or both solid phases as an indication of the presence of one or more analytes in the 
test sample. In this assay format, recombinant antigens derived from human expression 
systems may be utilized as well as monoclonal antibodies produced from the proteins derived 
from the mammalian expression systems as disclosed herein. Such assay systems are 
described in greater detail in EP Publication No. 0473065. 

In yet other assay formats, the polypeptides disclosed herein may be utilized to detect 
the presence of anti- MMP19 polypeptide in test samples. For example, a test sample is 
incubated with a solid phase to which at least one recombinant protein has been attached. 
These are reacted for a time and under conditions sufficient to form antigen/antibody 
complexes. Following incubation, the antigen/antibody complex is detected., Indicator 
reagents may be used to facilitate detection, depending upon the assay system chosen. In 
ai:v:her assay format, a test sample is contacted with a .solid phase to which a recombinant 
protein produced as described herein is attached and also is contacted with a monoclonal or 
polyclonal antibody:.specific for the protein, which preferably has been labeled with an 
indicator reagent. After incubation for a time and under conditions sufficient for 
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antibody/antigen complexes to form, the solid phase is separated from the free phase, and the 
• label is detected in either the solid or free phase as an indication of the presence of MMP 1 9 
polypeptide antibody. Other assay formats utilizing the recombinant antigens disclosed herein 
are contemplated. These include contacting a test sample with a solid phase to which at least 
one antigen from a first source has been attached, incubating the solid phase and test sample 
for a time and under conditions sufficient to form antigen/antibody complexes, and then 
contacting the solid phase with a labeled antigen, which antigen is derived from a second 
source different from the first source. For example, a recombinant protein derived from a first 
source such as E^coh is used as a capture antigen on a solid phase, a test sample is added to 
the so-prepared solid phase, and a recombinant protein derived from a different source (i.e., 
non- E. cQli ) is utilized as a part of an indicator reagent. Likewise, combinations of a 
recombinant antigen on a solid phase and synthetic peptide in the indicator phase also are ' 
possible. Any assay format which utilizes an antigen specific for MMP 19 polypeptide from a 
first source as the capture antigen and an antigen specific for MMP 19 polypeptide from a 
different second source are contemplated. Thus, various combinations of recombinant 
antigens, as well as the use of synthetic peptides, purified proteins, and the like, are within the 
scope of this invention. Assays such as this and others are described in U.S. Patent No. 
5,254,458, which enjoys common ownership and is incorporated herein by reference. 

Other embodiments which utilize various other solid phases also are contemplated and 
are within the scope of this invention. For example, ion capture procedures for immobilizing 
an immobilizable reaction complex with a negatively charged polymer (described in HP 
publication 0326100 and EP publication No. 0406473), can be employed according to the 
present invention to effect a fast solution-phase immunochemical reaction. An immobilizable 
immune complex is separated from the rest of the reaction mixture by ionic interactions 
between the negatively charged poly-anion/immune complex and the previously treated, 
positively charged porous matrix and delected by using various signal generating systems 
previously described, including those described in chcmilumincsccnt signal measurements as 
described in EPO Publication No. 0 273,1 15. 

Also, the methods of the present invention can be adapted for use in systems which 
utilize microparticle technology including in automated and semi-automated systems wherein 
the solid phase comprises a microparticle (magnetic or non-magnetic). Such systems include 
those described in published EPO applications Nos. EP 0 425 633 and EP 0 424 634, 
respectively. 

The use of scanning probe microscopy (SPM) for immunoassays also is a technology 
to which the monoclonal antibodies of the present invention are easily adaptable. In scanning 
probe microscopy, in particular in atomic force microscopy, the capture phase, for example, at 
least one of the monoclonal antibodies of the invention, is adhered to a solid phase and a 
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scanning probe microscope is utilized to detect antigen/antibody complexes which may be 
present on the surface of the solid phase. The use of scanning tunneling microscopy 
eliminates the need for labels which normally must be utilized in many immunoassay systems 
to detect antigen/antibody complexes. The use of SPM to monitor specific binding reactions 
5 can'occur in many ways. In one embodiment, one member of a specific binding partner 

(analyte specific substance which is the monoclonal antibody of the invention) is, attached to a 
surface suitable for scanning. The attachment of the analyte specific substance may be by 
adsorption to a test piece which comprises a solid phase of a plastic or metal surface, following 
methods known to those of ordinary skill in the art. Or, covalent attachment of a specific 

1 0 binding partner (analyte specific substance) to a test piece which test piece comprises a solid 

phase of derivatized plastic, metal, silicon, or glass may be utilized. Covalent attachment 
methods are known to those skilled in the art and include a variety of means to irreversibly link 
specific binding partners to the test piece. If the lest piece is silicon or glass, the surface must 
be activated prior to attaching the specific binding partner. Also, polyelectrolyte interactions 
15' may be used to immobilize a specific binding partner on a surface of a test piece by using 
techniques and chemistries. The preferred method of attachment is by covalent means. 
Following attachment of a specific binding member, the surface may be further treated with 
materials such as serum, proteins, or other blocking agents to minimize non-specific binding. 
The surface also may be scanned either at the site of manufacture or point of use to verify its 

2 0 suitability for assay purpo.ses. The scanning process is not anticipated to alter the specific 

binding properties of the test piece. 

While the present invention discloses the preference for the use of solid phases, it is 
contemplated that the reagents such as antibodies, proteins and peptides of the present 
invention can be utilized in non-solid phase assay systems. These assay systems are known to 

2 5 those skilled in the art, and are considered to be within the scope of the present invention. 

It is contemplated that the reagent employed for the assay can be provided in the form 
of a test kit with one or more containers such as vials or bottles, with each container containing 
a separate reagent such as a probe, primer, monoclonal antibody or a cocktail of monoclonal 
antibodies, or a polypeptide (either recombinant or synthetic) employed in the assay. Other 

3 0 components such as buffers, controls, and the like, known to those of ordinary skill in art, 

may be included in such test kits. It also is contemplated to provide test kits which have means 
for collecting test samples cornprising accessible body fluids, eg. blood, urine, saliva, and 
stool. Such collection means include lancets and absorbent paper or cloth for cui:.\:ring and 
stabilizing blood; swabs for collecting and stabilizing saliva; cups for collecting and stabilizing 
3 5 urine or stool samples. Collection materials, papers, cloths, swabs, cups and the like, may 
' optionally be treated to avoid denaturation or irreversible adsorption of the sample. The 
collection materials also may be treated with or contain preservatives, stabilizers or 



wo 98/40475 



PCT/US98/04694 



45 

antimicrobial agents to help maintain the integrity of the specimens. Test kits designed for the 
collection, stabilization, and preservation of test specimens obtained by surgery or needle 
biopsy are also useful. It is contemplated that all kits may be configured in two components; 
one component for collection and transport of the specimen, and the other component for the ^ 
analysis of the specimen. Further, kits for the collection, stabilization, and preservation of test 
specimens may be configured for use by untrained personnel and may be available in the open 
market for use at home with subsequent transportation to a laboratory for analysis of the test 
sample. 

E. coli bacteria (clone #907334) has been deposited at the American Type Culture 
Collection (A.T.C.C.), 12301 Parklawn Drive, Rockville, Maryland 20852, as of March 10, 
1997, under the terms of the Budapest Treaty and will be maintained for a period of thirty (30) 
years from the date of deposit, or for five (5) years after the last request for the deposit, or for 
the enforceable period of the U.S. patent, whichever is longer. The deposit and any other 
deposited material described herein are provided for convenience only, and are not required to 
practice the present invention in view of the teachings provided herein. The cDNA sequence in 
all of the deposited material is incorporated herein by reference. Clone #907334 was accorded 
- A.T.C.C. Deposit No . 

Having now generally described the invention, a complete understanding can be 
obtained by reference to the following specific examples. The following examples are given 
for th-- purpose of illusu-ating various embodiments of the invention and are not meant to limit 
the present invention in any fashion. 

Example 1: Isolation of Human cDNA Clones Homologous to MMPs 

A consensus of twelve human matrix metalloprotease amino acid sequences was 
obtained by aligning sequences using the PILEUP program in the Wisconsin Sequence 
Analysis Package (Genetics Computer Group [GCG], Madison, WI). The consensus 
sequence was derived from a plurality of seven sequences at a given amino acid position. The 
letter *'x" was .substituted for each amino acid that differed from the determined consensus 
residue and was used to represent any amino acid. 

The MMP consensus sequence (SEQ ID N0:7, sec FIG. 1) was used to search the 
LifeSeq^w human expression database (Incyle Pharmaceuticals, Inc., Palo Alto, CA) for 
human MMP sequences. This database is comprised of partial sequences of cDNA clone 
inserts, so-called expressed sequence tags (EST.s), derived jrom cDNA libraries from various 
human tissues. A search of LIFESEQ^*^ using the BLAST and Smith-Waterman similarity 
search algorithms with the MMP consensus sequence identified an EST (from clone #907334) 
unique to the LIFESEQ™ database and whose deduced amino acid sequence of one reading 
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frame contained the amino acid sequence PRCGVTD (SEQ ID N0:8), similar to the "cysteine 
switch" motif in the propeptide region of all MMPs. The EST from clone #907334 was 
derived from a colon tissue library using oligo-dT for the reverse transcription reaction. The 
cDNA clone #907334 in pSPORTI plasmid (GIBCO BRL, Gaithersburg, MD) was obtained 
5 for further study. 

Clone #907334 was amplified in bacteria and digested with the restriction enzymes 
EcoRI and.Notl to determine the size of the cDNA insert. Insert length was determined to be 
approximately 1900 base pairs. The complete nucleotide sequence of the clone (shown in 
HG. 4) was obtained by initially sequencing with plasmid primers and subsequently with 
I 0 primers near the end of each newly obtained sequence. All sequence information from each 
reaction was compiled and analyzed using the Sequencher'^'^. program (Gene Codes 
Corporation, Ann Arbor, MI). Sequencing was. continued until both strands of the cloned 
cDNA insert were sequenced in entirely. In FIG. 4, the sense strand (SEQ ID NO:l) is shown 
on top with its complement (SEQ ID NQ:2) aligned dircctly.beneath. 

1 5 ■• The deduced amino acid sequence of clone #907334 (SEQ ID NO: 1 0, also shown in ■ 

FIG. 4) encoded a polypeptide of at least 470 amino acids but lacked an initialing methionine . 
start codon and a portion of the putative pro-peptide domain. The amino acid sequence was 
compared to those of other MMPs (FIG. 5) and shown to contain a cysteine switch motif 
(SEQ ID NO:8) within the putative pro-peptide domain which was 86% homologous to the 

2 0 consensus cysteine switch motif of the other MMPs (PRCG VPD, SEQ ID NO: 11). In 

addition, MMP 19 contained an identical version (100%) of the highly conserved zinc binding 
consensus sequence (HEXaaGHXaaLGLXaaHS. SEQ ID NO: 12) found in the putative 
catalytic domain. The sequence also contained a potential recognition and cleavage site 
(RXKR, SEQ ID NO:33) for Kex2-like proteases shown to be involved in enzyme activation 

2 5 of stromelysin 3 and the membrane-type MMPs. 

Example 2: Localization of Clone #907334 Transcripts 

A. Nothem Blotting : The entire nucleotide sequence of clone #907334 (hereafter 

3 0 referred to as MMP19) was compared to LIFESEQ'''*^ and other databases (for example 

GenBank, Wash. U./Merck EST) using the BLAST and Smith-Waterman programs. Public 
databases did not contain sequences identical to MMP 19 cDNA. LIFESEQ*'''^ contained eight 
additional ESTs identical to some portion of MMP 19 cDNA (FIG. 6). The nine ESTs in 
LIFESEQ"^*^ which were identical to MMPi9 DNA had been sequenced from cDNA libraries 
3 5' derived from 8 different tissues, including colon, lung, prostate, spinal cord, ovary and 

prostate tumor, brain tumor and ganglioneuroma (FIG. 6) Thus, these tissues expressed at 
least small amounts of the mRNA encoded by MMPl 9 cDN A. 
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In order to determine whether MMP19 mRNA sequences could be detected by 

32 

hybridization, the entire 1717 bp fragment of MMP19 cDNA was radiolabelled with P using 

a commercial random primer labeling kit (Pharmacia, Piscataway, NJ). Specific activity of the 

8 

labeled fragment was determined to be -5 x 10 cpm/|ig DNA. The labeled cDNA was used 
as a probe to hybridize to mRNA (2 \ig polyA+ RNA) on Northern blots (specifically, 
Multiple Tissue Northerns 1 and 2, obtained from Clontech, Palo Alto, CA) which contained 
various human tissue RNA. The blots were prehybridized at 60°C for 1 hour in Express Hyb 
solution (supplied with the blots) and hybridized (also in Express Hyb solution) at the same 
temperature for two hours in the presence of denatured probe at i x 1 0^ cpm/mL. After 
washing the blots once in 2X SSPE + 0.1% SDS (20 min), and twice under stringent 
conditions (0.2X SSPE + 0.1% SDS, 50°C, 20 min. each wash), the filters were exposed and 
analyzed on a Phosphorimager (Molecular Dynamics, Sunnyvale, CA). By Northern blotting, 
it was determined that MMP19 mRNA transcript is approximately 2.4 kb and is present in 
human testes and to a lesser extent in human colon, ovary, small intestine, and prostate tissue 
(FIG. 11). Smaller less abundant transcripts with similar tissue distribution were also 
observed. 

B. Ribonuclease Protection Assay : Alternatively, instead of or in addition to 
performing a Northern blot, a ribonuclease protection assay may be performed lo deiermine 
whether MMP19 is present in particular tissues. A ribonuclease protection assay is performed 
as follows: 

]^ Labeling of Complementarv RNA (cRNA) Hvbridiz ation Probes. Labeled 
sense and antisense riboprobes are transcribed from the EST sequence which contains an RNA 
polymerase promoter such as SP6 or T7. The sequence may be from a vector containing the 
appropriate EST insert or from a PCR-generated product of the insert using PGR primers - 
which incorporate am RNA polymerase promoter sequence. The transcripts are prepared in a 
20 jiL reaction volume containing 1 ^g of DNA template, 2 of 100 mM dithiothreitol, 0.8 
^iL of RNasin (10-40U), 500 ^M each of ATP, CTP, GTP. 5 jiL (alpha^^P) UTP or 100-500 
[iM biotinylated UTP, and 1 ^iL of RNA polymerase in transcription buffer (40 mM Tris-HCI, 
pH 7.5, 6 mM MgCli, 2mM spermidine HCl, 5 mM NaCl). Following incubation at 37ooC 
for one hour, the transcripts are treated with DNase I (15 U) for an additional 30 min to digest 
the template. The probes then are isolated by spin columns, salt precipitation or 
electrophoresis techniques which are well-known in the art. Finally, the probes are dissolved 
in lysis bufk (5 M Guanidine Thiocyanate, 0. 1 M EDTA, pH 7.0). 

2. Hybridization of Labeled Pro be to Target. Approximately 20 ^lg of 
extracted total cellular RNA, prepared as described in Sambrook, et al supra, is placed in 10 
^iL of lysis buffer and mixed with either (i) 1x10^ cpm of radioactively labeled probe or (ii) 
250 pg of non-isoiopically labeled probe, each in 2 M.L of lysis buffer. The mixture then is 



wo 98/40475 



PCT/US98/04694 



48 

incubated at 60°oC for 5 min and hybridized overnight at room temperature. See; T. kaabache 
et ai , Anal. Biochem, 232: 225-230 ( 1 995). 

3. RNase Digestion . Hybridizations are terminated by incubation with 380.)iL 
of a solution containing 40 |ig/mL RNase A and 625 U/mL RNase Tl in 1 mM EDTA, 300 

5 mM NaCl, 30 mM Tris-HCl pH 7.4 for 45-60 min at room temperature. RNase digestion then 
is terminated by the addition of 60 \iL of proteinase-K (1 .7 mg/mL) containing 3.3% SDS, 
followed by incubation for 30 min at 37°C. The digested mixture then is extracted with r 
phenol:ch]oroform:isoamyl alcohol to remove protein. The mRNAxRNA hybrids are 
precipitated from the aqueous phase by the addition 4 |ig yeast tRNA and 800 |iL of ethanol, 
1 0 and incubation at -80°C for 30 min. The precipitates are coilected.by centrifugation. 

4. Fragment Analysis ; The precipitates are dissolved in 5 ^.L of denaturing gel 
loading dye (80% formamide, 10 mM EDTA, pH 8.0, 1 mg/mL xylene cyanol, 1 mg/mL 
bromophenol blue) and elecirophoresed in 6 % polyacrylamide TBE, 8 M urea denaturing 
gels. The gels are dried under vacuum and autoradiographcd. Quantitation can be. performed 

1 5 by comparing the counts obtained from the test samples to a calibration curve that was 

generated by utilizing calibrators that are the sense strand. In cases where non-isotopic labels 
are used, hybrids are transferred from the gels to membranes (nylon or nitrocellulose) by 
blotting and then analyzed using detection systems that employ streptavidin alkaline 
phosphatase conjugates and chemiluminesence or chemifluoresence reagents. Again, 

2 0 expression of an mRNA which is detectable by the labeled probe in a particular tissue suggests 

that MMPI9 is produced in that tissue. 

C. Dot Blot/Slot Blot : Dot and slot blot assays are also quick methods to evaluate the 
presence of a specific nucleic acid sequence in a complex mix of nucleic acid. To perform, up 
to 20 ^g of RNA is mixed in 50 |iL of 50% formamide, 7% formaldehyde, IX SSC, incubate 

2 5 15 min at 68°C and cool on ice. Then, 100 ^iL of 20X SSC is added to the RNA mixture and 

loaded under vacuum onto a manifold apparatus that has a prepared nitrocellulose or nylon 
membrane. The membrane is soaked in water, 20X SSC for 1 hour, placed on two sheets of 
20X SSC prewet Whatman #3. filter paper, and loaded into a slot blot or dot blot vacuum 
manifold apparatus. The slot blot is analyzed with probes prepared and labeled as described 

3 0 supra. Other methods and buffers not specifically detailed here are described in J. Sambrook 

et ai, supra. 

D. In Situ Hvbridization : This method is useful to directly detect specific target 
nucleic acid sequences in cells using detectable nucleic acid hybridization probes. * 

Tissues are prepared with cross-linking fixatives agents such as paraformaldehyde or 
3 5 glutaraldehyde for maximum cellular RNA retention. See, L. Angerer et ai. Methods in Cell 
BioL 35: 37-71 (1991). Briefly, the tissue is placed in greater than 5 volumes of 1% 
glutaraldehyde in 50 mM sodium phosphate, pH 7.5 at 4°C for 30 min. The solution is 
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changed with fresh solution for a further 30 niin fixing. The fixing solution should have an 
osmolality of approximately 0.375% NaCl. The tissue is washed once in isotonic NaCl to 
remove the phosphate. 

The fixed tissues then are embedded in paraffin, as follows. The tissue is dehydrated 
through a series of ethanol concentrations for 15 min each: 50% twice, 70% twice, 85%, 90% 
and 100% twice. The tissue next is soaked in two changes of xylene for 20 min each at room 
temperature; then it is soaked in two changes of 1 xylene: I paraffin for 20 min each at 60°C; 
and then it is soaked in three final changes in paraffin for 15 min each. 

The tissue next is cut in 5 |im sections using a standard microtome and placed on a 
slide previously treated with the tissue adhesive 3-aminopropyliriethoxysilane. 

Paraffin is removed from the tissue by two 10 min xylene soaks and rehydrated in a 
series of ethanol concentrations; 99% twice, 95%, 85%, 70%, 50%. 30% and distilled water 
twice. The sections are pre-ireaied with 0.2 M HCl for 10 min and permeabiiized with 2 
|ig/mL Proteinase-K at 3TC for 1 5 min. 

* Labeled riboprobes transcribed from the pSPORTl plasmid containing fragments of 
* MMP19 cDNA are hybridized to the prepared tissue sections and hybridized overnight at 
56ooC in 3X standard saline extract and 50% formamide. Excess probe is removed by 
washing in 2X standard saline citrate and 50% formamide followed by digestion with 100 
^ig/mL RNase A at 37ooC for 30 min. Fluorescence probe is visualized-by illumination with 
UV light under a microscoper Fluorescence in the cytoplasm is indicative- of mRNA 
production. Fluorescence in the nucleus detects the presence of genomic material. 
Alternatively, the sections can be visualized by autoradiography. 

Pxample 3: Hxpression o f MMPIQ Sequences 

A. ronstmction of Fxpre.^ssion V e rtnrQ rnntaininP QNA Fragmftnys F.ncodinR the 
rmnlvtic Domains nf MMP19 Protein: The MMP19 plasmid was used as template in PGR 
reactions to generate DNA fragments encoding MMP19 protein for introduction into two 
cukaryotic expression vectors, pcDNA3.l (Invitrogen, San Diego, CA) and pCINeo 
(Promega, Madison, WI). The two upstream primers used in PGR reactions, SEQ ID N0:3 
(5'-GTATCTGTAGAGAGGATGTTTGGAAAGGAAGGTAAGAAATGGTAGAAGC-3^)and 
SEQ ID N0:4 (S'.GTATCTCTAGACACCATGAAGGTTCTGTGGGCTGCGTTGCTGGT- 
CACATTCCrGGCAGGATGCCAGGCCTTTGCAAAGCAAGGTAACAAATGG-3') 
contained sequences to create an Xbal restriction enzyme site, a Kozak consensus sequence for 
translation initiation, an ATG start codon, and a sequence of nucleotides from nucleotide 
position 2 1 4 to nucleotide position 244 of SEQ ID NO: I , which coTresponded to the N- 
terminus of the putative catalytic domain of the enzyme. SEQ ID N0:4 also contained a 
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sequence which encodes a signal peptide (from the secreted protein, apolipoprotein E), for 
transport of the translated protein into the secretory pathway. The signal peptide sequence was 
placed upstream from and in frame with the sequence encoding the-N-tenminus of the putative 
catalytic domain. Downstream primers used in PGR reactions were either SEQ ID N0:5 (5*- 
GTACITCTAGACTACTTGTCAT.CGTCGTCCTTGTAGTCACCACCGAACAGGGCG- 
CTCCCCGAGTTGGCATGCC-3') which encodes an epitope recognized by anti-FLAG M2, 
(available from Sigma Biosciences, St. Louis. MO) or SEQ ID N0:6 (5*-GTACTTCTAGA- 
GATCTTCTTCACTGATCAGCTTCTGTTCACCACCGAACAGGGCGCTCCCCGAGTTG 
-GCATGCC-3') which encodes an epitope recognized by anti-c-Myc (Santa Cruz 
Biotechnology, Santa Cruz, CA). In both downstream primers, these epitope sequences were 
in frame with the last codon in the open reading frame of the cDNA and were followed by a 
stop codon (TGA) and an Xbal restriction enzyme site. 

PGR amplifications were performed using primers sets SEQ ID N0S:3 and 5, SEQ ID 
N0S:3 and 6, SEQ ID N0S:4 and 5 and SEQ ID NOS:4 and 6 under standard PGR 
conditions, i.e. in a total reaction volume of 50 )xL containing 200 p.M of each dNTP wherein 
N was A, T, G and C, 1 of each primer, -50 ng template DNA and 1 unit of AmpliTaq 
DNA polymerase (Perkin Elmer, Norwalk, CT). Amplifications were performed for a total of 
35 cycles (1 cycle = 94°G for 1 min, 60°C for 1 min, and 72^C for 2 min). After 
amplification, PGR products were digested with Xbal, gel purified, and ligated into the Xbal 
site of pGDNA3. 1 and pGINeo. After transformation of the ligated DNA into appropriate 
bacterial hosts (for example DH5a), plasmid DNA was prepared from individual clones and 
subjected to restriction enzyme and sequence analysis to identify clones that contained MMP19 
DNA with the correct sequence and in the proper orientation. 

B. Transfection of MMP19DNA Expression Vectors into HEK293 Cells: Expression 
of MMP19 constructs in HEK293 cells is achieved by transfection using a modified calcium- 
phosphate procedure (Chen and Okayama, Mol Cell Biol 7: 2745-2752, 1987). HEK-293 
cells (ATGG GRL1573) are grown in minimal essential medium (MEM) containing 10% fetal 
bovine serum (FBS) and antibiotic/antimycotic solution at 37° C in 5% GO2, 95% O2. Cells 
are transfected with purified plasmid MMP19 DNA (20 M.g/2.5 X 10^ cells/10 cm plate). For 
stable transfections, cells are grown in the presence of G4I8 and MEM/FBS. For both 
transient and stable transfections, serum-free media is harvested 3 days after medium 
replacement: 

C. Dete: (ion of MMP 19 protein fWestern blot): Media from transfected cells is mixed 
(volivol, 1:1) with 2X denaturing buffer ( 10 mM Tris-HCl, pH6.8, 4% SDS, 20% glycerol, 
1% 6-mercapioethanol, 0.02% bromphenol blue) and denatured by heating 5 minutes, 95°G. 
Aliquots (20|il) of each sample are electrophorescd for -1 hour on a 10-20% Tris-tricine 
minigel ( 10 X 10 cm) at -50 mA constant current. Gels are transferred to PVDF membranes 



wo 98/40475 



PCT/US98/04694 



by electroblotting at 200 mA for 1.5 hours. Non-specific binding sites on PVDF membranes 
are blocked for 1 hour at room temperature with 5% nonfat dry milk in Tris-buffered saline 
containing Tween-20 (TBS-Tween).(20 mM Tris, pH 7.6, 140 mM NaCl, 0.1% Tween-20). 
Blots are then incubated for 1 hour at room temperature in TBS-Twecn/5% milk containing an 
5 appropriate amount of primary antibody (either anti-FLAG or anti-c-Myc). Blots are washed 
in 3 changes of TBS-Tween for a lota! of 45 minutes at room temperature before incubating ' 
with the detection antibody (1:1000 dilution) for 1 hour at room temperature in TBS- 
Tween/5% milk. Blots are again washed in 3 changes of TBS-Tween for 45 minutes at room 
temperature. Protein bands are visualized by autoradiography using ECL Western blotting 
1 0 detection reagents (Amersham, Arlington Heights, IL). 

Example 4: Assays of Enzvmatic Activity of MMP19 

Purification of the MMP19 protein containing the FLAG sequence is performed by 

1 5 immunoaffinity chromatography using an affinity matrix comprising anti-FLAG M2 

monoclonal antibody covalently attached to agarose by hydrazide linkage (Eastman Kodak 
Co,, New Haven, CT). Prior to affinity purification, medium from transfected HEK293 cells 
is exchanged into 50 mM Tris-HCl pH 7.5, 1 50 mM NaCl buffer using a Sephadex G-25 
(Pharmacia Biotech Inc., Uppsala, Sweden) column. Protein in this buffer is applied to the 

2 0 anti-FLAG M2 antibody affinity column, non-binding protein is eluted by washing the column 

with 50 mM Tris-HCl pH 7.5, 150 mM NaCl buffer, and bound protein is eluted using an 
excess of FLAG peptide in 50 mM Tris-HCl, pH 7.5, 150 mM NaCI. The excess FLAG 
peptide is removed by size exclusion chromatography and column fractions containing 
MMP19 protein are dialyzed in 50 mM Tris-HCl, pH 7.5, 150 mM NaCl, 10 MM CaCI. 

2 5 - Assays to determine enzyme activity are performed using a fluorogenic peptide 

substrate Gly-Glu(EDANS)-Gly-Pro-Uu-Gly-Leu-Tyr-Ala-Ly.s(DABCYL)-GIy (SEQ ID 
NO: 1 3). SEQ ID NO: 1 3 was synthesized and purified acording the the procedure of E.D. 
Matayoshi et al. Science, 247: 954-958, 1990). Hydrolysis of the Gly-Leu bond'in SEQ ID 
NO: 13 results in a 40-fold increase in fluorescence. 

3 0 Purified MMP19 protein from transfected HEK293 cell medium is serially diluted and 

incubated in microliter plate wells with 100 jiM fluorogenic peptide substrate, 50 mM Tris- 
HCl. pH 7.5, 150 mM NaCl in a final volume of 150 ^L. Assay mixtures are incubated at 
room temperature and the progress of the reaction monitored for up to I hour in an Titcrtek 
Fluoroskan II instrument (ICN Biomedicals, Huntsville, AL) with an excitation filter set at 335 
3 5 nm and emission filter at 485 nm. Background fluorescence is determined using identical 

reactions in the absence of MMP19. Data is collected online with a Macintosh computer using 
DELTA SOFT IL version 4.0 (BioMetallics, Inc., Princeton, NJ). Nonlinear curve fitting is 
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performed using KaleidaGraph (Synergy Software, Reading, PA). Compounds which 
potentially inhibit MMP 19 activity are screened by including serial dilutions.of each compound 
with the above reaction and comparing the MMP 19 enzymatic activity in the presence and 
absence of compound. 

Example 5: Proteolysis of pro-MMPs bv MMP 19 

Interaction of MMP 19 with other MMPs is determined by incubation of purified active 
MMP19 with the zymogen (pro*peptide) forms of Gelatinase-A (72 kD) [MMP-2] and 
Gelatinase-B (92 kD) [MMP-9]. Geiatinase-A and Gelalinase-B are isolated from HT- 1080 
cell cultures stimulated with TNFa as described in J. Antibiotics 45: 1733-1737, 1992 and 
Biochem. i. 285: 603-61 1, 1992. After 1 hour incubation at 37°C in buffer containing 50 mM 
Tris-HCl, pH 7.5, 150 mM NaCl and approximately 250 ng of each enzyme, aliquots are 
removed and electrophoresed on 10-20% Tris-Tricine gels. Western blots of gels are 
performed using antibodies to Gelatinse A and Gelatinase B (Oncogene Science, Cambridge, 
MA) as described above and proteolysis is indicated by the presence of bands of lower 
molecular weight relative to the pro-enzyme forms of Gelatinase A and Gelatinase B. 

Example 6: Production of Polvclonal Antibodies to MMP19 

Synthetic peptides (SEQUENCE ID NOS:! 4- 1 7, see Table 1 below) are prepared 
based upon the predicted amino acid sequence of the MMP 19 polypeptide. 



Tabic 1 




Sequence 


SEQ ID NO: 


RHRTKMRRKKRFAKQGN 


14 


FQGDHNDGLGNAFDG 


15 


RSLQDWGGIPEEVSGALPRPDGSIl 


16 


ATELPWMGCWHANSGSALF 


17 



Peptides are synthesized on an ABI Peptide Synthesizer (Applied Biosystems, Foster 
City, CA), using standard reagents and conditions known in the art for solid phase peptide 
synthesis (see for example, Stewart, J.M., and Young, DJ., Solid Phase Peptide Synthesis, 
W.H. Freeman Co., San Francisco, 1963). Cleavage of the peptide from the resin and final 
deprotection of the peptide are achieved by adding the resin to 20 ml trifluoroacetic acid (TFA), 
0.3 ml water, 0.2 ml ethanedithiol, 0.2 ml thioanisole and 100 mg phenol, and stirring at room 
temperature for 1.5 hours. The resin then is filtered by suction and the peptide obtained by 
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precipitation of the TFA solution with ether, followed by filtration. Each peptide is purified 
via reverse-phase preparative HPLC using a water/acetonitrile/0. 1 %TFA gradient and 
lyophilized. The product is confirmed by mass spectrometry. 

To generate antigens for immunization, the purified peptides arc conjugated to Keyhole 
5 Limpet Hemocyanin (KLH) and bovine serum albumin (BS A) using an Imject Activated 

Immunogen Conjugation Kit (Pierce, Rockford, IL) in accordance with the manufacturer^ 
instructions. 

Polyclonal antisera are generated using the protocol of the Berkeley Antibody 
Company (Berkeley, CA). Before receiving the first immunization, a sample of preimmune 
1 0 blood (5 ml) is drawn from each of at least 2 rabbits. Afterward, each rabbit is injected 
subcutaneously with an aliquot of KLH-conjugated peptide (200-500 mg) in Complete 
Freunds Adjuvant. After 21 days, the immune response is boosted with a second 
subcutaneous injection of KLH-conjugated peptide (100-250 mg) in Incomplete Freund^s 
Adjuvant. Blood (50 ml) is collected on day 3 1 and serum tested for reactivity to BSA- 

1 5 coupled peptide using an enzyme linked immunoadsorbani assay (ELISA). Subsequent boosts 

with KLH-conjugated peptide are given on days 42, 63 and 84 (post injection #1) and 
production bleeds (50 ml) drawn on days 52, 73 and 94 for testing by ELIS A to determine 
antibody titer. Serum is then stored at -20°C until further use. 

2 0 Rxamplc 7: Isolation of the 5^ end of MMP19 cDNA 

CapFinder cDNA synthesis kit (Clontech, Palo Alto, CA) was used to amplify DN A at 
the 5' end of MMP19 cDNA. Briefly, reverse transcription reactions using human testes 
mRNA (Clontech, Palo Alto, CA) were performed as specified by the manufacturer. The 

2 5 cDNA was amplified by PCR using an oligonucleotide primer complementary to MMP19 

cDNA (nucleotides 704-678 of SEQ ID N0:1) and the 5' CapSwitch oligonucleotide. PCR 
amplifications were performed in a total reaction volume of 100 nL containing 200 of each 
dNTP wherein N was A, T, G and C, 1 ^iM 6f each primer, 20 ^iL of cDN A from the reverse 
transcription reaction, and 1 unit of AmpliTaq DNA polymerase (Perk in Elmer, Norwalk, 

3 0 CT). Amplifications were performed for a total of 35 cycles ( 1 cycle = 94°C for 1 min, 66^C 

for 1 min, and 72X for 2 min). Two PCR reaction products ( 1 .3 kb and 1 .0 kb) identified by 
agarose gel electrophoresis and elhidium bromide staining were large enough to contain 
sequences upstream of the 5' end of the original sequence. These fragments were purified and 
subcloned into pCR-Script (Stratagene, La Jolla, CA) in accordance with the protocol provided 

3 5 by the supplier. 

Clones are analyzed for sequences encoding the putative 5^ end of MMP19 mRNA 
using an oligonucleotide complementary to MMPl 9 cDN A (nucleotides 704-678 of SEQ ID 
NO: 1 ) in standard nucleotide sequencing reactions. Clones are verified as authentic MN4P19 



wo 98/40475 



PCT/US98/04694 



54 

cDNA by PGR amplification of predicted size bands using testes mRNA as template and 
primers within both the new sequence and complementary to the original jMMP19 cDNA 
sequence (nucleotides 655-626 of SEQ ID NO:l). 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Falduto, Michael 

Magnuson, Scott R. 
Morgan, Douglas w. Morgan 

(ii, TITLE OF THE INVENTION: HUMAN MATRIX METALLOPROTEASE . 
PROTEINS ENCODED THEREFROM AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Abbott Laboratories 

(B) STREET: 100 Abbott Park Road 

(C) CITY: Abbott Park 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP: 60064-3500 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(01 SOFTWARE: FastSEQ Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: ll-MAR-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Casuto, Dianne 

(B) REGISTRATION NUMBER: P-40,943 

(C) REFERENCE/DOCKET NUMBER: 6073. US. 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (847) 93B-3137 

(B) TELEFAX: (847) 938-2623 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 
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U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE: ' nucleic acid 

(C) STRANDEDNESS : single 

(D) . TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



AAAGCTCCCA CCTCCACTCG ATTCAGCGAT GCCATCAGAG CGTTTCAGTG GGTGTCCCAG 60 

CTACCTGTCA GCGGCGTGTT GGACCGCGCC AACCTGCGCC AGATGACTCG TCCCCGCTGC 120 

GGGGTTACAG ATACCAACAG TTATGCGGCC TGGGCTGAGA GGATCAGTGA CrTGTTTGCT 180 

AGACACCGGA CCAAAATGAG GCGTAAGAAA CGCTTTGCAA AGCAAGGTAA CAAATGGTAC 24 0 

AAGCAGCACC TCTCCTACCG CCTGGTGAAC TGGCCTGAGC ATCTGCCGGA GCCGGCAGTT 3 00 

CGGGGCGCCG TGCGCGCCGC CTTCCAGTTG TGGAGCAACG TCTCAGCGCT GGAGTTCTGG 3 60 

GAGGCCCCAG CCACAGGCCC CGCTGACATC CGGCTCACCT TCTTCCAAGG GGACCACAAC 42 0 

GATGCGCTGG GCAATGCCTT TGATGGCCCA GGGGGCGCCC TGGCGCACGC CTTCCTGCCC 4 80 
CGCCGCGGCG AAGCGCACTT CGACCAAGAT GAGCGCTGGT CCCTGAGCCG CCGCCGCGGG " 54 0 

CGCAACCTGT TCGTGGTGCT GGCGCACGAG ATCGGTCACA CGCTTGGCCT CACCCACTCG 600, 

CCCGCGCCGC GCGCGCTCAT GGCGCCCTAC TACAAGAGGC TGGGCCGCGA CGCGCTGCTC 660 

AGCTGGGACG ACGTGCTGGC CGTGCAGAGC CTGTATGGGA AGCCCCTAGG GGGCTCAGTG 72 0 

GCCGTCCAGC TCCCAGGAAA GCTGTTCACT GACTTTGAGA CCTGGGACTC CTACAGCCCC 780 

CAAGGAAGGC GCCCTGAAAC GCAGGGCCCT AAATACTGCC ACTCTTCCTT CGATGCCATC 84 0 

ACTGTAGACA GGCAACAGCA ACTGTACATT TTTAAAGGGA GCCATTTCTG GGAGGTGGCA 900 

GCTGATGGCA ACGTCTCAGA GCCCCGTCCA CTGCAGGAAA GATGGGTCGG GCTGCCCCCC 960 

AACATTGAGG CTGCGGCAGT GTCATTGAAT GATGGAGATT TCTACTTCTT CAAAGGGGGT 1020 

CGATGCTGGA GGTTCCGGGG CCCCAAGCCA GTGTGGGGTC TCCCACAGCT GTGCCGGGCA 1080 

GGGGGCCTGC CCCGCCATCC TGACGCCGCC CTCTTCTTCC CTCCTCTGCG CCGCCTCATC 114 0 

CTCTTCAAGG GTGCCCGCTA CTACGTGCTG GCCCGAGGGG GACTGCAAGT GGAGCCCTAC 12 00 

TACCCCCGAA GTCTGCAGGA CTGGGGAGGC ATCCCTGAGG AGGTCAGCGG CGCCCTGCCG 126 0 

AGGCCCGATG GCTCCATCAT CTTCTTCCCA GATGACCGCT ACrCGCGCCT CGACCAGGCC 13 20 

AAACTGCAGG CAACCACCTC GGGCCGCTGG GCCACCGAGC TGCCCTGGAT GGGCTGCTGG 13 80 

CATGCCAACT CGGGGAGCGC CCTGTTCTGA AGGCACCTCC TCACCTCAGA AACTGGTGGT 1440 

GCTCTCAGGG CAAAATCATG TTCCCCACCC CCGGGGCAGA ACCCCTCTTA GAAGCCTCTG 1500 

AGTCCCTCTG CAGAAGACCG GGCAGCAAAG CCTCCATCTG GAAGTCTGTC TGCCTTTGTT 156 0 

CCTTGAAGAA TGCAGCATTG TCTTTGTCTG TCCCCACCAC ATGGAGGTGG GGGTGGGATC 16 20 

. AATCTTAGGA AAAGCAAAAA AGGGTCCCAG ATCCCTTGGC CCTTTCCTCC GAAAAAAAAA 1680 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAA ^'^^'^ 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

TTTCGAGGGT GGAGGTGAGC TAAGTCGCTA CGGTAGTCTC GCAAAGTCAC CCACAGGGTC 
GATGGACAGT CGCCGCACAA CCTGGCGCGG TTGGACGCGG TCTACTGAGC AGGGGCGACG 
CCCCAATGTC TATGGTTGTC AATACGCCGG ACCCGACTCT CCTAGTCACT GAACAAACGA 
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240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1717 



^r^nrr-r rcTTTTACTC CGCATTCTTT GCGAAACGTT TCGTTCCATT GTTTACCATG 
TCTGTGGCCT GGTTTTACTC CGCAii^ ^nrrmrTCG TAGACGGCCT CGGCCGTCAA 
TTCGTCGTGG AGAGGATGGC GGACCACTTG ACCGGACTC^ l^^TCOCGP. CCTCAAGACC 
GCCCCGCGGC ACGCGC^^^^ ^^Zl^, fcCG^GA JgAAGGTTCC CCTGGTGTTG 
CTCCGGGGTC GGTGTCCGGG GCGACTGTA^ cCCCCGCGGG ACCGCGTGCG GAAGGACGGC 
CTACCCGACC CGTTACGGAA ACTACCGG^^ l^ll^^^ GGGACTCGGC GGCGGCGCCC 

gcggcgccgc ttcgcgtgaa gctggttcta ctcgc^^ gcgaaccgga gtgggtgagc 

GCGTTGGACA AGCACCACGA ^^^^j^ ■ ^^^^^ ACC^^GCGCT GCGCGACGAG 
■ GGGCGCGGCG CGCGCGAGTA CCGCGGG^^^ J J^I^^cS icGGGGATCC CCCGAGTCAC 
TCGACCCTGC TGCACGACCG ^CACGTCTCG ^ACATAC^ gqaCCCTGAG GATGTCGGGG 
CGGCAGGTCG ^GGGTCCTTT CGACAAG^^^ ^^^JS TGAGAAGGAA GCTACGGTAG 
GTTCCTTCCG CGGGACTTTG CGTCCCGGGA "^^J CGGTAAAGAC CCTCCACCGT 

TGACATCTGT CCGTTGTCGT TGACATGTAA ^"^gj cTACCCAGCC CGACGGGGGG 
CGACTACCGT TGCAGAGTCT CGGGGCAGGT GACGTCCTTT J ^^^,^CCA 
TTGTAACTCC GACGCCGTCA CAGTAACTTA CTACCTCTAA ^G CACGGCCCGT 
CCTACGACCT CCAAGGCCCC ^^f lf^^ SgSgG GAGGAGACGC GGCGGAGTAG 
CCCCCGGACG GGGCGGTAGG ACTGCGGCGG JAGAA^ CTGACGTTCA CCTCGGGATG 

GAGAAGTTCC cacgggcgat gatgcacgac ^J^S^^^J tccagtcgcc GCGGGACGGC 

ATGGGGGCTT CAGACGT^CT GACCCCTCCG J^^^J^^^ ;c;,cCGCGGA GCTGGTCCGG 
TCCGGGCTAC CGAGGTAGTA GAAGAAGGCT "ACTG ^^^^^^^ CCCGACGACC 

TTTGACGTCC GTTGGTGGAG CCCGGCGACC CGGTGGC^^^ ^GTGGAGTCT TTGACCACCA 
GTACGGTTGA GCCCCTCGCG C^^ACAAGACT TCCGTG^^^^ ^^J^^AGAAT CTTCGGAGAC 
CGAGAGTCCC GTTTTAGTAC AAGGGGTGGG ^^CCCGT^^ cTTCAGACAG ACGGAAACAA 
TCAGGGAGAC GTCTTCTGGC CCGTCGTTTC ^J^^^J^J^ TACCTCCACC CCCACCCTAG 

^™ 

TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTT 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single ' . - 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
GTATCTCTAG ACACCATGTT TGCAAAGCAA GGTAACAAAT GGTACAAGC 
(2) INFORMATION FOR SEQ ID N0:4: . 

(i) SEQUENCE CHARACTERISTICS: _ 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
GTATCTCTAG ACACCATGAA GGTTCTGTGG ..TGCGTTGC TGGTCACATT CCTGGCAGGA SO 
TGCCAGGCCT TTGCAAAGCA AGGTAACAAA 'iGG 



49 
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{2) INFORMATION FOR SEQ ID NO: 5; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTACTTCTAG ACTACTTGTC ATCGTCGTCC TTGTAGTCAC CACCGAACAG GGCGCTCCCC 60 
GAGTTGGCAT GCC 73 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: • 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single* 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:S: 

GTACTTCTAG AGATCTTCTT CACTGATCAG CTTCTGTTCA CCACCGAACA GGGCGCtCCC 60 
CGAGTTGGCA TGCC 74 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

1 5 • 10 . , 15 

xaa Xaa Xaa Leu Xaa Xaa. Leu Leu Leu Leu Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

35 40 ■ 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Leu Xaa Xaa Tyr Tyr Xaa Leu Xaa 

50 55 60 

Xaa xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 75 . 80 

Xaa Xaa Xaa Leu Xaa Xaa Met Gin Xaa Phe Phe Gly Leu Xaa Val Thr 

B5 90 S5 

Gly Lys Leu Asp Xaa Xaa Thr Leu Glu Xaa Met Xaa Lys Pro Arg Cys 
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100 










105 










110 






Gly Val 


Pro 


Asp 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Val 


Xaa 


Xaa 






115 










120 










125 








Phe 


Xaa 

130 


Leu 


Xaa 


Pro 


Xaa 


Xaa 
135 


Pro 


Lys 


Trp 


Xaa 


Xaa 
140 


Xaa 


Xaa 


Xaa 


Thr 


Tyr 


Arg 


lie 


Xaa 


Asn 


Tyr 


Thr 


Pro 


Asp 


Leu 


Xaa 


Xaa 


Xaa 


Xaa 


Val 


Asp 


145 








150 










155 




Val 






160 


Xaa 


Ala 


He 


Xaa 


Lys 
165 


Ala 


Phe 


Xaa 


Val 


Trp 
170 


Ser 


Xaa 


Thr 


Pro 
175 


Leu 


Xaa 


Phe 


Xaa 


Xaa 


Val 


xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Glu Gly 






180 










165 










190 






Xaa 


Ala 


Asp 


He 


Met 


He 


Xaa 


Phe Ala Xaa Xaa Glu His Gly Asp Xaa 




195 










200 










205 








Xaa 


Pro 


Phe 


Asp Gly 


Pro Gly Gly Xaa 


Leu 


Ala 


His 


Ala 


Phe 


Xaa 


Pro 




210 










215 










220 










Gly Pro Gly 


He Gly 


Gly Asp Ala 


His 


Phe Asp Asp Asp 


Glu 


Xaa 


Trp 


225 










230 










235 










240 


Thr 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 








245 










250 










255 




Xaa 


Xaa 


Xaa 


Xaa 
260 


Xaa 


Xaa 


Xaa 


Xaa 


xaa 

265 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
270 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
275 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
280 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
285 


Xaa 


Xaa 


Xaa 


xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 




290 










295 










300 










Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


305 










310 










315 










320 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
325 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
330 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
335 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
340 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
345 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
350 


Xaa 


Xaa 


Xaa 


Xaa 


xaa 
355 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
360 


Xaa 


Xaa 


Xaa 


Xaa 


•Xaa 
365 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa' Xaa 


Xaa 


Xaa 


Xaa 


Xaa 




370 










375 










380 










Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


385 










390 










395 










400 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
405 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
410 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
415 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Gly Xaa 


Asn 


Leu 


Phe 


Leu 


Val 


Ala 


Ala 


His 


Glu 








420 










425 










430 






Xaa Gly 


His 


Ser 


Leu 


Gly 


Leu 


Xaa 


His 


Ser 


Xaa Asp 


Pro 


Xaa 


Ala 


Leu 






435 










440 










445 








Met 


Tyr 


Pro 


Xaa Tyr 


Xaa 


Xaa 


Phe 


Xaa 


Asp 


Xaa 


Xaa 


Xaa 


Phe 


Xaa 


Leu 




450 










455 










460 










Xaa 


Xaa Asp 


Asp 


He 


Xaa Gly 


He 


Gin 


Xaa 


Leu Tyr Gly Xaa 


Xaa 


Xaa 


465 










470 










475 










480 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
485 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 

490 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 

495 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
500 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
505 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
510 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
515 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
520 


Xaa 


Xaa 




Xaa 


Xaa 
525 


Xaa 


Xaa 


Xaa 
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Xaa 


Xaa 


Xaa 


Xaa 


Xaa Xaa 


Xaa Xaa 


Pro Xaa Xaa Cys Asp Xaa Xaa Xaa 




530 








535 


540 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa Xaa 


Xaa Xaa 


Xaa Phe Asp Ala He Xaa Xaa Xaa 


545 








550 




555 560 


Arg 


Gly 


Glu 


Xaa 


Phe Phe 


Phe Lys Asp Arg Phe Phe Trp Arg Xaa Xaa 










565 




570 575 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa Xaa 


Xaa Xaa 


Xaa Xaa Xaa Leu lie Xaa Xaa Phe 








580 






585 590 


Trp 


Pro 


Xaa 


Leu 


Pro Xaa 


Xaa He Asp Ala Ala Tyr Glu Xaa Xaa Xaa 






595 






600 


605 


Xaa 


Xaa 


Xaa 


Val 


Phe Phe 


Phe Lys Gly Xaa Xaa Tyr Trp Xaa Tyr Xaa 




610 








615 


620 


Gly 


Xaa 


Xaa 


Xaa 


Xaa Xaa Gly Tyr 


Pro Xaa Xaa He Xaa Xaa Xaa Leu 


625 








630 




€35 640 


Gly 


Phe 


Pro 


Xaa 


Xaa Val 


Xaa Xaa 


He Asp Ala Ala Xaa Xaa Xaa Xaa 










645 




. 650 655 


Xaa 


Xaa 


Xaa 


Xaa 


Lys Thr Tyr Phe 


Phe Xaa Xaa Xaa Xaa Tyr Trp Arg 








660 






665 670 


Tyr 


Asp 


Glu 


Xaa 


Xaa Xaa 


Xaa Met 


Asp Pro Gly Tyr Pro Lys Xaa He 






675 






680 


665 


Xaa 


Xaa 


Xaa 


Phe 


Xaa Gly 


He Xaa 


xaa Xaa Val Asp Ala Val Phe Xaa 




690 








695 


700 


Xaa 


Xaa 


Xaa 


Xaa 


Gly Phe Xaa Tyr 


Phe Phe Xaa Gly Xaa Xaa Xaa Tyr 


705 








710 




715 720 


Xaa 


Phe 


Xaa 


Xaa 


Xaa Xaa 


Xaa Xaa 


Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa 



725 730 735 



Xaa Xaa Xaa Xaa Trp Leu Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

740 745 750 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

755 760 765 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

770 .775 780 * 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
785 790 795 800 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305' '810 815 

Xaa Xaa Xaa Xaa 
820 

{2} INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: ' 



Pro Arg Cys Gly Val Thr Asp 

1 ' ' 5 ' 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Arg Lys Lys Arg 
1 

(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 46 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:10: 

Lys Ala Pro Thr Ser Thr Arg Phe Ser Asp Ala lie Arg Ala Phe Gin 

1 5 10 • . 15 

Trp Val Ser Gin Leu Pro Val Ser Gly Val Leu Asp Arg Ala Asn Leu 

20 " ^° . ^ 

Arg Gin Mec Thr Arg Pro Arg Cys Gly Val Thr Asp Thr Asn Ser Tyr 

35 ' 40 . *5 

Ala Ala Trp Ala Glu Arg He Ser Asp Leu Phe Ala Arg His Arg Thr 

50 55 60 

Lys Met Arg Arg Lys Lys Arg Phe Ala Lys Gin Gly Asn Lys Trp Tyr 
65 70 75 BO 

Lys Gin His Leu Ser Tyr Arg Leu Val Asn Trp Pro Glu His Leu Pro 

85 90 95 

Glu Pro Ala val Arg Gly Ala Val Arg Ala Ala Phe Gin Leu Trp Ser 

100 105 
Asn val Ser Ala Leu ciu Phe Trp Glu Ala Pro Ala Thr Gly Pro Ala 

115 .120 125 

ASP He Arg Leu Thr Phe Phe Gin Gly Asp His Asn Asp Gly Leu Gly 

130 135 • 140 

Asn Ala Phe Asp Gly Pro Gly Gly Ala Leu Ala His Ala Phe Leu Pro 
145 150 155 1«° 

Arg Arg Gly Glu Ala His Phe Asp Gin Asp Glu Arg Trp Ser Leu Ser 

165 170 175 

Arg Arg Arg Gly Arg Asn Leu Phe Val Val Leu Ala His Glu He Gly 

180 185 190 

His Thr Leu Gly Leu Thr His Ser Pro Ala Pro Arg Ala Leu Met Ala 

200 205 



195 
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Pro 


Tyr Tyr 


Lys 


Arg Leu Gly Arg Asp Ala Leu Leu 


Ser Trp Asp 


Asp 




210 






215 220 










val 


Leu 


Ala 


Val 


Gin Ser Leu Tyr Gly Lys Pro Leu 


Gly Gly Ser 


Val 


225 








230 235 








240 


Ala 


Val 


Gin 


Leu 


Pro Gly Lys Leu Phe Thr Asp Phe 


Glu 


Thr 


Trp 


Asp 










245 250 






255 




Ser 


Tyr 


Ser 


Pro 


Gin Gly Arg Arg Pro Glu Thr Gin 


Gly 


Pro 


Lys 


Tyr 








260 


265 




270 






Cys 


His 


Ser 


Ser 


Phe Asp Ala He Thr Val Asp Arg 


Gin 


Gin 


Gin 


Leu 






275 




280 


285 








Tyr 


lie 


Phe 


Lys 


Gly Ser His Phe Trp Glu Val Ala 


Ala Asp Gly 


Asn 




290 






295 . 300 










Val 


Ser 


Glu 


Pro 


Arg Pro Leu Gin Glu Arg Trp Val 


Gly Leu 


Pro 


Pro 


305 








310 315 








320 


Asn 


He 


Glu 


Ala 


Ala Ala Val Ser Leu Asn Asp Gly 


Asp 


Phe 


Tyr 


Phe 










325 330 






335 




Phe 


Lys 


Gly Gly 


Arg Cys Trp Arg Phe Arg Gly Pro 


Lys 


Pro 


Val 


Trp 








340 


345 




350 






Gly 


Leu 


Pro 


Gin 


Leu Cys Arg Ala Gly Gly Leu Pro 


Arg 


His 


Pro 


Asp 






355 




360 


365 








Ala 


Ala 


Leu 


Phe 


Phe Pro Pro Leu Arg Arg Leu He' Leu 


Phe 


Lys 


Gly 




370 






375 380 










Ala 


Arg Tyr , Tyr 


Val Leu Ala Arg Gly Gly Leu. Gin Val 


Glu 


Pro 


Tyr 


385 








390 395 








400 


Tyr 


Pro Arg 


Ser 


Leu Gin Asp Trp Gly Gly lie Pro 


Glu 


Glu 


Val 


Ser 










405 410 






415 




Gly 


Ala 


Leu 


Pro 


Arg Pro Asp Gly Ser He He Phe 


Phe 


Arg Asp 


Asp 








420 


425 




430 






Arg 


Tyr Trp Arg 


Leu Asp Gin Ala Lys Leu Gin Ala 


Thr 


Thr 


Ser 


Gly 






435 




440 


445 








Arg 


Trp 


Ala 


Thr 


Glu Leu Pro Trp Met Gly Cys Trp His Ala Asn 


Ser 




450 






455 460 










"Gly 


Ser 


Ala 


Leu 


Phe 











465 



(2) INFORMATION FOR SEO ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

• (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Arg Cys Gly Val Pro Asp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

'Ill SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
His Glu Xaa GXy His Xaa Leu Gly Leu Xaa His Ser 
1 5 ' 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

. (c) STRANDEDNESS: s'^ingle 
..(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l3: 

Gly Glu Gly Pro Leu Gly Leu Tyr Ala Lys Gly 
^ 5 ^ ^° 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino, acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 

Arg His Arg Thr Lys Met Arg Arg Lys Lys Arg Phe Ala Lys Gin Gly 

1 5 . 10 

Asn 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 



wo 98/40475 PCT/US98/04694 

64 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 15: 

Phe Gin Gly Asp His Asn Asp Gly Leu Gly Asn Ala Phe Asp Gly 
1 5 10 • 15 

(2) INFORMATION FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 25 amino acids . . 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Arg Ser Leu Gin Asp Trp Gly Gly He Pro Glu Glu Val Ser Gly Ala 

1 5 10 15 

Leu Pro Arg Pro Asp Gly Ser He He 
20 25 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ala Thr Glu Leu Pro Trp Met Gly Cys Trp His Ala Asn Ser Gly Ser 

1 , 5 10 15 

Ala Leu Phe 



(2) 'INFOR24ATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SOB amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Asn Cys Gin Gin Leu Trp Leu Gly Phe Leu Leu Pro Met Thr Val 
1 5 10 15 
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Ser Gly Arg Val Leu Gly Leu Ala Glu Val Ala Pro Val Asp Tyr Leu 

20 25 30 

Ser Gin Tyr Gly Tyr Leu Gin Lys Pro Leu Glu Gly. Ser Asn Asn Phe 

35 40 45 

Lys Pro Glu Asp He Thr Glu Ala Leu Arg Ala Phe Gin Glu Ala Ser 

SO 5.5 60 

Glu Leu Pro Val Ser Gly Gin Leu Asp Asp Ala Thr Arg Ala Arg Met 
65 70 75 80 

Arg Gin Pro Arg Cya Gly Leu Glu Asp Pro Phe Asn Gin Lys Thr Leu 

85 90 95 

Lys Tyr Leu Leu Leu Gly Arg Trp Arg Lys Lys His Leu Thr Phe Arg 

100 lOS 110 

He Leu Asn Leu Pro Ser Thr Leu Pro Pro His Thr .Ala Arg Ala Ala 

115 120 125 

Leu Arg Gin Ala Phe Gin Asp Trp Ser Asn Val Ala Pro Leu" Thr Phe 

130 135 140 

Gin Glu Val Gin Ala Gly Ala Ala Asp He Arg Leu Ser Phe His Gly 
145 150 155 160 

Arg Gin Ser Ser Tyr Cys Ser Asn Thr Phe Asp Gly Pro Gly Arg Val 

165 170 175 

Leu Ala His Ala Asp He Pro Glu Leu Gly Ser Val His Phe Asp Glu 

160 185 190 

Asp Glu Phe Trp Thr Glu Gly Thr Tyr Arg Gly Val Asn Leu Arg lie 

195 200 205 

He Ala Ala His Glu Val Gly His Ala Leu Gly Leu Gly His Ser Arg 

210 215 220 

Tyr Ser Gin Ala Leu Met Ala Pro Val Tyr Glu Gly Tyr Arg Pro His 
225 230 235 240 

Phe Lys Leu His Pro Asp Asp Val Ala Gly He Gin Ala Leu Tyr Gly 

245 250 255 

Lys Lys Ser Pro Val He Arg Asp Glu Glu Glu Glu Glu Thr Glu Leu 

260 265 270 

Pro Thr Val Pro Pro Val Pro Thr Glu Pro Ser Pro Met Pro Asp Pro 

275 280 : 285 

Cys Ser Ser Glu Leu Asp Ala Met Met Leu Gly Pro Arg Gly Lys Thr 

290 295 300 

Tyr Ala Phe Lys Gly Asp Tyr Val Trp Thr Val Ser Asp Ser Gly Pro 
305 310 315 320 

Gly Pro Leu Phe Arg Val Ser Ala Leu Trp Glu Gly Leu Pro Gly Asn 

325 330 335 

Leu Asp Ala Ala Val Tyr Ser Pro Arg Thr Gin Trp He His Phe Phe 

340 345 350 

Lys Gly Asp Lys Val Trp Arg Tyr He Asn Phe Lys Met Ser Pro Gly 

355 ' 360 365 

Phe Pro Lys Lys Leu Asn Arg Ser Glu Pro Asn Leu Asp Ala Ala Leu 

370 375 380 

Tyr Trp Pro Leu Asn Gin Lys Val Phe Leu Phe Lys Gly Ser Gly Tyr 
385 390 395 ^ 400 

Trp Gin Trp Asp Glu Leu Ala Arg Thr Asp Phe Ser Ser Tyr Pro Lys 

405 410 415 

Pro He Lys Gly Leu Phe Thr Gly Val Pro Asn Gin Pro Ser Ala Ala 

420 425 430 

Met Ser Trp Gin Asp Gly Arg Val. Xyr Phe Phe Lys Gly Lys Val Tyr 
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445 



Trp Arg heu Asn Gin Gin Leu Irg Val Glu Lys Gly Tyr Pro Arg Aan 
lie Ser His Asn Trp Met ^i'i! Cys Arg Pro Arg Thr He Asp Thr Thr ' 

^•vn 475 lOU 



III ser Gly Gly Asn tH Thr, Pro Ser Gly Thr Gly He Thr Leu Asp 
485- 

Thr Thr Leu Ser Ala Thr Glu Thr Thr Phe Glu Tyr 
500 505 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 660 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 

Met Glu Ala Leu Met Ala Arg Gly Ala Leu Thr Gly Pro Leu Arg Ala 

■ Leu Cys Leu Leu Gly Cys Leu Leu Ser His Ala Ala Ala Ala Pro Ser 
20 

pro lie lie Lys Phe Pro Gly Asp val Ala Pro Lys Thr Asp Lys Glu 

Leu Ala "l Gin Tyr Leu Asn iSr Phe Tyr Gly Cys Pro Lys Glu Ser 

cys Tsn Leu Phe Val Leu Lys Asp Thr Leu Lys Lys Met Gin Lys Phe 

IL Gly Leu pro Gin llr Gly Asp Leu Asp Gin Asn Thr lie 'Glu Thr 
85 

Met Arg Lys Pro Arg Cys Gly Asn, Pro Asp Val Ala Asn Tyr Asn Phe 

100 

Phe Pro Arg Lys Pro Lys Trp Asp Lys Asn Gin He Thr Tyr Arg He 
lie Gly iyr Thr Pro Asp Leu IJp Pro Glu Thr Val Asp Asp Ala Phe 
Ala irg Ala Phe Gin Val ^rp Ser Asp Val Thr Pro Leu Arg Phe Ser 
irg lie His ASP Gly Glu Ala Asp He Met ul Asn Phe Gly Arg Trp 
Glu His Gly ASP g" Tyr Pro Phe Asp Gly Lys. Asp Gly Leu Leu Ala 

185 i. I? V 

His Ala Phe Ala Pro Gly Thr Gly Val Gly Gly Asp Ser His Phe Asp 

195 200 205 

;.sp ASP Glu Leu Trp Thr Leu Gly Glu Gly Gin Val Val Arg Val Lys 

Tyr III Asn Ala Asp Gly III Tyr Cys Lys Phe Pro .Ph. Leu Phe Asn 

Gly Lys Glu Tyr Asn Ser Cys Thr Asp Thr Gly Arg Ser Asp Gly Phe 
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.eu Trp Cys Ser Thr Thr Tyr Asn Phe "^Ly. Asp GXy Lys "r Oly 
P.e cys pro "s Glu AXa Leu Phe Thr Met Gly Gly Asn Ma Glu Gly 
cm pro C^s .ys Phe Pro Phe Vrl Phe Gin Gly Thr Ser Tyr Asp Ser' ■ 
cys Thr Glu Gly Ar^ lH Asp CXy Tyr Ars -p Cys Gly Thr Thr 
^lu ASP Tyr Asp Ar. Z Lys .ys Tyr Gly Phe Cys Pro Glu Thr Ala 

- r-i.. rl« Ala Pro Cys Val Phe Pro 
Met ser Thr Val Gly Gly Asn ser Glu Gly Ala Pro 

Pne Thr Ph, Z Gly As. Lys Tyr Glu Ser Cys Thr Ser Ala Gly Ar. 
ser ASP Gly .ys Met Trp Cys Ala Thr Thr Ala Asn Tyr Asp Asp Asp 

3''° rln Glv Tyr Ser Leu Phe Leu Val 

Arg Lys Trp Gly Phe Cys Pro Asp Gin Gly Ty^ 

^ IV u<o Via Met Gly Leu Glu His Ser Gin Asp 
Ala Ala His Glu Phe Gly Hxs Ala Met Gly b 

, ^ Ti- T«T- Thr Tvr Thr Lys Asn Phe Arg 
pro Gly Ala Leu Met Ala Pro He Tyr Thr Tyr ^ 

.eu ser Cln Z Asp XI. Gly Xle Cln Cl. Leu Tyr CXy AXa Ser 
P.O ASP ri: ASP Leu Gly Thr dly Pro Thr Pro Thr Leu CXy Pro vaX 
.Hr Z Glu lie Cys Lys c!n Asp Xle Val Phe Asp Gly He Ala Gin- 
;;Urg Gly Glu Xle Jh: Phe Phe Lys ASP Arg Phe lie Trp Arg Thr . 

r. M-f r\v Pro Leu Leu Val Ala Thr Phe 
val Thr Pro Arg Asp Lys Pro Met Gly Pro Leu u 

.rp pro CXu l" Pro OXu Lys XXe Lp AXa Val Tyr CXu AXa Pro GXn 
Clu GXu Lys AXa Val Phe Phe OXy Asn GXu Tyr Trp XXe Tyr Ser 

^, »r« III Tvr Pro Lys Pro Leu Thr Ser Leu Gly 
Ala Ser Thr Leu Glu Arg Gly Tyr pro uy 

Te! pro pro Asp Val Gin Arg Val Asp Ala Ala Phe Asn Trp Ser Lys 
Asn Lys Lys Thr Tyr XXe Phe AXa GXy Asp Lys Phe Trp Arg Tyr Asn 
CXU VaX Lys Ly^ Lys Met Asp Pro GXy Phe Pro Lys .Leu XXe AXa Asp • 

cac 600 , 

, ala Val Val >sp Leu Gin 

AXa Trp Asn AXa iXe Pro Asp Asn Leu Asp AXa VaX A p 

CXy III CXy His ser Tyr P^^ Phe Lys GXy AXa Tyr Tyr Leu Lys Leu 

"! Asn CXn ser Leu Ly^s Ser VaX Lys Phe GXy Ser XXe Lys Ser Asp 



645 

Trp Leu Gly Cys 
660 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
Met Ser-Leu Trp Gin Pro Leu Val Leu Val Leu Leu Val Leu Gly Cys 
cys Phe Ala Ala Pro Arg Gin" Arg Gin Ser Thr Leu Val Leu Phe Pro 
Gly ASP Leu Irg Thr Asn Leu Thr Hp Arg Gin Leu Ala Glu Glu Tyr 
Leu Tyr Arg Tyr Gly Tyr Thr Irg Val Ala Glu Met Arg Gly Glu Ser 



55 



Lys ser Leu Gly Pro Ala Leu Leu Leu Leu Gin Lys Gin Leu Ser Leu 

Glu Thr Gly Glu Leu Asp Serbia Thr Leu Lys Ala Met Arg Thr 
95 '° 
Pro Arg Cys Gly Val Pro Asp Leu Gly Arg Phe Gin Thr Phe Glu Gly 

100 

ASP Leu Lys Trp His His. His Asn lie Thr Tyr Trp lie Gin Asn Tyr 

115 120 
ser Glu ASP Leu Pro Arg Ala Val lie Asp Asp Ala Phe Ala Arg Ala 

Phe "la Leu Trp Ser Ala III Thr Pro Leu Thr Phe Thr Arg Val Tyr 

III Arg ASP Ala Asp Ue Val He Gin Phe Gly Val Ala Glu His Gly 

165 ^"^^ 
Asp Gly Tyr Pro Phe Asp Gly Lys Asp Gly Leu Leu Ala His Ala Phe 

Pro Pro Gly Pro Gly He Gin Gly Asp Ala His Phe Asp Asp Asp Glu 

195 200 205 

Leu Trp ser Leu Gly Lys Gly Val Val Val Pro Thr Arg Phe Gly Asn 

Ala ASP Gly Ala Ala Cys His Phe Pro Phe lie Phe Glu Gly Arg Se.r 

Tyr ser Ala Cys Thr Thr Asp Gly Arg Ser Asp Gly Leu Pro Trp Cys 
' 250 255 



ser Thr Thr Ala Asn Tyr Asp Thr Asp Asp Arg Phe Gly Phe Cys Pro 

260 265 270 

ser Glu Arg Leu Tyr Thr Arg Asp Gly Asn Ala Asp Gly Lys Pro Cys 

Gin Phe Pro Phe He Phe Gin Gly Gin Ser Tyr Ser Ala Cys Thr Thr 

290 295 
ASP Gly P.ra Ser Asp Gly Tyr Arg Trp Cys Ala Thr Thr Ala Asn Tyr 

Isp Arg ASP Lys Leu Phe Gly Phe Cys Pro Thr Arg Ala Asp Ser Thr 
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vai »« oiy "° "° III 

p^, L.U =ly W Tyr s.r Thr Cy. Thr s.. «» oly Ar, oly »=p 

«y llu T,p CVS M. T»r T«r s.r P« ^^P "P ■ ' 

^ Z cy, »o ASP =xy Ty. s„ «u P« U.. VU AI. AU 

;n =iu p.e «y »i. <=iv - A.P :n °"' ■ 

„. HS. Tyr "o »ec Ty. A„ Jh. T« =.u «y "l- 

.y. „p ASP 'v.; A.„ Oly °" 

P„ «a "I A„ pro T.r m Thr THr Pro cln Pro T»r Al. Pro 

Pro4« v.. cy. Pro T»r "y Pro Pro T-r ..J Kl. Pro S.r cl„ Ar, 

Vrl T»r Al. Cly »o =>y Pro Pro s.r AU Cly Pro T^r =ly Pro 

„o T.r Al. Oly s.r T-r Al. TJr TH, v.l Pro L.u S.r Pro v.l 

„p ..p Al. A.O V.1 A.„ U. P.. ASP Al. «• Al. =1. H. Oly 

„„ 01. Tyr PH. Ly. Asp oly .y. Tyr Trp Ar, Ph. S.r CU 

"° . o^^ rin Glv Pro Phe Leu He Ala Asp Lys Trp 

Gly Arg Gly Ser Arg Pro Gin Gly Pro t-ne , 

To Ala Leu pro Arg ""s Leu Asp Ser Val Phe Glu Clu Pro Leu-Ser 

Lys Leu Phe p" Phe Ser Gly Arg Cln Val Trp Val Tyr Thr Gly 

Ma ser Val l" Gly Pro Arg Arg Leu Asp Lys Leu Oly Leu Gly Ala 

cQc 600 
.sp val AU Gin val Thr Gly Ma Leu Arg Ser Gly Arg Gly Lys Me. 

.eu Z Phe ser Gly Arg Irg Leu Trp Arg Phe Asp Val Lys Ala Gin 

c-,. Clu Val ASP Arg Met Phe Pro Gly 
• Met Val Asp Pro Arg Ser Ala Ser Glu Vai Asp g 655 

£45 obU 

val pro Leu Asp Thr His Asp Val Phe Gin Tyr Arg Glu Lys Ala Tyr 

Phe Cys Gin Z Arg Phe Tyr Trp "g Val Ser Ser Arg Ser Glu Leu 

680 

.sn Gin III ASP Gin Val Gly Tyr Val Thr Tyr Asp Xle Leu Gin Cys 



690 
Pro Glu Asp 
705 



(21 INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 477 amino acidS 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION :^ SEQ ID N0:21: 



Met Lys Ser Leu Pro 


He 


Leu Leu Leu Leu Cys Val 


Ala Val 


Cys 


Ser 


1 5 




10 




15 




Ala Tyr Pro Leu Asp Gly Ala Ala Arg Gly Glu Asp 


Thr Ser 


Met 


Asn 


20 




25 


30 






Leu Val Gin Lys Tyr 


Leu 


Glu Asn Tyr Tyr Asp Leu 


Lys Lys 


Asp 


Val 


35 




40 


45 






Lys Gin Phe Val Arg Arg 


Lys Asp Ser Gly Pro Val 


Val Lys 


Lys 


He 


50 




55 60 








Arg Glu Met Gin Lys 


Phe 


Leu Gly Leu Glu Val Thr 


Gly Lys Leu 


Asp 


65 


70 


75 






80 


Ser Asp Thr Leu Glu 


Val 


Met Arg Lys Pro Arg Cys 


Gly Val 


Pro 


Asp 


85 




90 




95 




Val Gly His Phe Arg Thr 


Phe Pro Gly He Pro Lys 


Trp Arg 


Lys 


Thr 


100 




105 


110 






His Leu Thr Tyr Arg 


He 


Val Asn Tyr Thr Pro Asp 


Leu Pro 


Lys 


Asp 


115 




120 


125 






Ala Val Asp Ser Ala 


Val 


Glu Lys Ala Leu Lys Val 


Trp Glu Glu 


Val 


130 




135 140 








Thr Pro Leu Thr Phe Ser Arg Leu Tyr Glu Gly Glu 


Ala Asp 


He 


Met 


145 


150 


155 






160 


lie Ser Phe Ala Val 


Arg 


Glu. His Gly Asp Phe Tyr 


Pro Phe 


Asp 


Gly 


165' 




170 




175 




Pro Gly Asn Val Leu 


Ala 


His Ala. Tyr Ala Pro Gly 


Pro Gly 


He 


Asn 


180 




185 


190 






Gly Asp Ala His Phe 


Asp 


Asp Asp Glu Gin Trp Thr 


Lys Asp 


Thr 


Thr 


195 




200. 


205 






Gly Thr Asn Leu Phe 


Leu 


Val Ala Ala His Glu He 


Gly His 


Ser 


Leu 


210 




215 220 








Gly Leu Phe His Ser 


Ala 


Asn Thr Glu Ala Leu Met 


Tyr Pro 


Leu 


Tyr 


225 


230 


235 






240 


His Ser Leu Thr Asp 


Leu Thr Arg Phe Arg Leu Ser 


Gin Asp Asp 


He 


245 




250 




255 




Asn Gly lie Gin Ser 


Leu Tyr Gly Pro Pro Pro Asp 


Ser Pro 


Glu 


Thr 


260 




265 


270 






Pro Leu Val Pro Thr 


Glu 


Pro Val Pro Pro Glu Pro 


Gly Thr 


Pro 


Ala 


275 




280 


285 






Asn Cys Asp Pro Ala 


Leu 


Ser Phe Asp Ala Val Ser 


Thr Leu 


Arg 


Gly 


290 




295 300 








Glu lie Leu He Phe 


Lys 


Asp Arg His Phe Trp Arg 


Lys Ser 


Leu 


Arg 


305 


310 


315 






320 


Lys Leu Glu Pro Glu 


Leu 


His Leu He Ser Ser Phe 


Trp Pro 


Ser 


Leu 


325 




330 




335 




Pro Ser Gly Val Asp 


Ala Ala Tyr Glu Val Thr Ser 


Lys Asp Leu 


Val 


340 




345 


350 
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Phe lie Phe Lys Gly Asn Gin Phe Trp Ala lie Arg Gly Asn Glu Val 

Xrg Ala Gly Tyr Pro Arg Gly III His Thr Leu Gly Phe Pro Pro Thr 

375 380 



355 

val Trg Lys He Asp- Ala 111 He Ser Asp Lys Glu Lys Asn Lys Thr 
390 395 



Ty'r Phe Phe Val Glu Zl Lys Tyr Trp Arg Phe Asp Glu Lys Arg Asn 
ser Met Glu Pro Gly Phe Pro Lys Gin Til Ala Glu Asp Phe Pro Gly 
lie ASP ser "s He Asp Ala Val Ph! Glu Glu Phe Gly Phe Phe Tyr 



Phe Phe iir Gly Ser Ser Gin Glu Phe Asp Pro Asn Ala Lys Lys 



440 



val Jhr His Thr Leu Lys S^r Asn Ser Trp Leu Asn Cys 



4S5 



460 



465 



470 



475 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Met His Leu Ala Phe Leu Val Leu Leu Cys Leu Pro Val Cys Ser 

aL Tyr pro Leu Ser Gly Ala Ala Lys gL Glu Asp Ser Asn Lys Asp 

Leu Ala Gin lln Tyr Leu Glu Lys Tyr Tyr Asn Leu Glu Lys -Asp Val 

Lys Gin Phe Arg Arg Lys Asp Ser Asn Leu He Val Lys Lys lie Gin 

Gly Met Gin Lys Phe Leu Gly Leu Glu Val Thr Gly Lys Leu Asp Thr 

Isp Thr Leu Glu Val Met Arg Lys Pro Arg Cys Gly Val Pro Asp Val. 

Gly His Phe ser Ser Phe Pro Gly Met Pro Lys Trp Arg Lys Thr His 
100 

Leu Thr Tyr Arg He Val Asn Tyr Thr Pro Asp Leu Pro Arg Asp Ala 

120 ^25 



val ASP ser Ala He Glu Lys Ala Leu Lys Val Trp Glu Glu Val Thr 

pro Leu Thr Phe Ser Arg Leu Tyr Glu Gly Glu Ala Asp He Met He 

III Phe Ala val Lys llu His Gly Asp Phe Tyr Ser Phe Asp Gly Pro 
165 ^''O 



Gly His ser Leu Ala His Ala Tyr Pro Pro Gly Pro Gly Leu Tyr Gly 
180 
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Asp He His Phe Asp Asp Asp Glu Lys Trp Thr Glu Asp Ala Ser Gly 

195 200 205 

Thr Asn Leu Phe Leu Val Ala Ala His Glu Leu Gly His Ser Leu Gly 

210 215 220 

Leu Phe His ser Ala Asn Thr Glu Ala Leu Met Tyr Pro Leu Tyr Asn 
225 230 .240 

ser Phe Thr Glu Leu Ala Gin Phe Arg Leu Ser Gin Asp Asp Val Asn 

245 250 255 

Gly lie Gin Ser Leu Tyr Gly Pro Pro Pro Ala Ser. Thr Glu Glu Pro 

260 265 270 

Leu val Pro Thr Lys Ser Val Pro Ser- Gly Ser Glu Met Pro Ala Lys 

275 280 285 

CVS ASP pro Ala Leu Ser Phe Asp Ala lie Ser Thr Leu Arg Gly Glu 

290 295 
Tyr Leu Phe Phe Lys Asp Arg Tyr Phe Trp Arg Arg Ser His Trp Asn 
305 310 315 

pro Glu Pro Glu Phe His Leu He Ser Ala Phe Trp. Pro Ser Leu Pro 

325 330 33S 

ser Tyr Leu Asp Ala Ala Tyr Glu Val Asn Ser Arg Asp Thr Val Phe 

340 345 350 

lie Phe Lys Gly Asn Glu Phe Trp Ala lie Arg Gly Asn Glu Val Gin 

355 360 365 

Ala Gly Tyr Pro Arg Gly He His Thr Leu Gly Phe Pro Pro Thr He 

370 375 380 

Arg Lys He Asp Ala Ala Val Ser Asp Lys Glu Lys Lys Lys Thr Tyr 
385 390 395 400 

Phe Phe Ala Ala Asp Lys Tyr Trp Arg Phe Asp Glu Asn Ser Gin Ser 

405 410 415 

Met Glu Gin Gly Phe Pro Arg Leu He Ala Asp Asp Phe Pro Gly Val 

420 425 430 

Glu Pro Lys Val Asp Ala Val Leu Gin Ala Phe Gly Phe Phe Tyr Phe 

435 440 445 

Phe Ser Gly Ser Ser Gin Phe Glu Phe Asp Pro Asn Ala Arg Met Val 

450 455 460 

Thr His He Leu Lys Ser Asn Ser Trp Leu His Cys 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 23: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 469 amino acids. 

(B) TYPE: amino acid 

(C) STRANDEDMESS; single- 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Met His Ser Phe Pro Pro Leu Leu Leu Leu Leu Phe Trp Gly Val Val 

1 5 . " ^5 . 

Ser His Ser Phe Pro Ala Thr Leu Glu Thr Gin Glu Gin Asp Val Asp 

20' 25 
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Leu Val Gin Lys Tyr Leu 
35 



Glu Lys Tyr Tyr Asn Leu Lys 
40 45 



Asn Asp Gly 



. Arg Gin Val Glu Lys Arg Arg Asn Ser Gly Pro Val Val Glu Lys Leu 
50 55 60 

Lys Gin Met Gin Glu Phe Phe Gly Leu Lys Val Thr Gly Lys Pro Asp 
65 70 75 BO 

Ala Glu Thr Leu Lys Val Met Lys Gin Pro Arg Cys Gly Val Pro Asp 

85 90 95 

Val Ala Gin Phe Val Leu Thr Glu Gly Asn Pro Arg Trp Glu Gin Thr 

100 105 110 

His Leu Thr Tyr Arg lie Glu Asn Tyr thr Pro Asp Leu Pro Arg Ala 

115 120 125 

Asp Val Asp His Ala He Glu Lys Ala Phe Gin Leu Trp Ser Asn Val 

130 135 140 

Thr Pro Leu Thr Phe Thr Lys Val Ser Glu Gly Gin Ala Asp He Met 
145 150 155 160 

lie Ser Phe Val Arg Gly Asp His Arg Asp Asn Ser Pro Phe Asp Gly 

165 170 175 

Pro Gly Gly Asn Leu Ala His Ala Phe Gin Pro Gly Pro Gly He Gly 

180 185 190 

Gly Asp Ala His Phe Asp Glu Asp Glu Arg Trp Thr Asn Asn Phe Arg 

195 200 205 

Glu Tyr Asn Leu His Arg Val Ala Ala His Glu Leu Gly His Ser Leu 

210 215 220 

Gly Leu Ser His Ser Thr Asp He Gly Ala Leu Met Tyr Pro Ser Tyr 
225 230 235 240 

Thr Phe Ser Gly Asp Val Gin Leu Ala Gin Asp Asp He Asp Gly He 

245 250 ^ 255 

Gin Ala He Tyr Gly Arg Ser Gin Asn Pro Val Gin Pro He Gly Pro 

260 265 270 

Gin Thr Pro Lys Ala Cys Asp Ser Lys Leu Thr Phe Asp Ala He Thr 

275 280 285 

Thr He Arg Gly Glu Val Met Phe Phe Lys Asp Arg Phe Tyr Met Arg 

290 295 300 

Thr Asn Pro Phe Tyr Pro Glu Val Glu Leu Asn Phe He Ser Val Phe 
305 310 315 320 

Trp Pro Gin Leu Pro Asn Gly Leu Glu Ala Ala Tyr Glu Phe Ala Asp 

325 330 335 

Arg Asp Glu Val Arg Phe Phe Lys Gly Asn Lys Tyr Trp Ala Val Gin 

340 345 350 

Gly Gin Asn Val Leu His Gly Tyr Pro Lys Asp He Tyr Ser Ser Phe 

355 360 365 

Gly Phe Pro Arg Thr Val. Lys His He Asp Ala Ala Leu Ser Glu Glu 

370 375 380 

Asn thr Gly Lys Thr Tyr Phe Phe Val Ala Asn Lys Tyr Trp Arg Tyr 
385 390 395 400 

Asp Glu Tyr Lys Arg Ser Met Asp Pro Gly Tyr Pro Lys Met He Ala 

405 410 415 

His Asp Phe Pro Gly He Gly His Lys Val Asp Ala Val Phe Met Lys 

420 425 430 

Asp Gly Phe Phe Tyr Phe Phe His Gly Thr Arg Gin Tyr Lys Phe Asp 

435 440 445 

Pro Lys Thr Lys Arg He Leu Thr Leu Gin Lys Ala Asn Ser Trp Phe 
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450 455 • 460 

Asn Cys Arg Lys Asn 
465 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 4 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Phe Ser Leu Lys Thr Leu Pro Phe Leu Leu Leu. Leu His Val Gin 

1 5 10 15 

lie Ser Lys Ala Phe Pro Val Ser Ser Lys Glu Lys Asn Thr Lys Thr 

20 25 30 

Val Gin Asp Tyr Leu Glu Lys Phe Tyr Gin Leu Pro Ser Asn Gin Tyr 

35 40 45 

Gin Ser Thr Arg Lys Asn Gly Thr Asn Val lie Val Glu Lys Leu Lys 

50 55 60 

Glu Met Gin Arg Phe Phe Gly Leu Asn Val Thr Gly Lys Pro Asn Glu 
65 70 75 80 

Glu Thr Leu Asp Met Met Lys Lys Pro Arg Cys Gly Val Pro Asp Ser 

85 90 95 

Gly Gly Phe Met Leu Thr Pro Gly Asn Pro Lys Trp Glu Arg Thr Asn 

100 105 110 

Leu Thr Tyr Arg lie Arg Asn Tyr Thr Pro Gin Leu Ser Glu Ala Glu 

115 120 125 . 

Val Glu Arg Ala He Lys Asp Ala Phe Glu Leu Trp Ser Val Ala Ser 

130 '135 140 

Pro Leu He Phe Thr Arg He Ser Gin Gly Glu Ala Asp He Asn He 
145 150 155 160 

Ala Phe Tyr Gin Arg Asp His Gly Asp Asn Ser Pro Phe Asp Gly Pro 

165 170 175 

Asn Gly He Leu Ala His Ala Phe Gin Pro Gly Gin Gly He Gly Gly 

180 185 190 

Asp Ala His Phe Asp Ala Glu Glu Thr Trp Thr Asn Thr Ser Ala Asn 

195 200 205 

Tyr Asn Leu Phe Leu Val Ala Ala His Glu Phe Gly His Ser Leu Gly 

210 215 220 

Leu Ala His Ser Ser Asp Pro Gly Ala Leu MeC Tyr Pro Asn Tyr Ala 
225 ^ 230 235 240 

Phe Arg Glu Thr Ser Asn Tyr Ser Leu Pro Gin Asp Asp He Asp Gly 

245 250 255 

He Gin Ala He Tyr Gly Leu Ser Ser Asn Pro He Gin Pro Thr Gly 

260 26S 270 

Pro Ser Thr Pro Lys Pro Cys Asp Pro Ser Leu Thr Phe Asp Ala He 

275 280 285 

Thr Thr Leu Arg Gly Glu He Leu Phe Phe Lys Asp Arg Tyr Phe Trp 





Val 


Glu Met 


Asn 


PH^ Tla Cai^ T.fitl 
rilCS Q6£^ wCVA 






315 




J A U 


Gly 


He 


Gin Ala 


Ala 


Tvr Glvi A.sn Phe 






330 




J J 3 


Phe 


Lys Gly Asn 




Tvr Tt»t» Ala Let! 




345 








Gly 


Tyr 


Pro Lys 


Asp 


lie Ser Asn Tyr 


360 








365 




He 


Asp Ala 


Ala 


Val Phe Tyr Arg 








380 




Asn 


Asp 


Gin Phe 


Trp 


Arg Tyr Asp Asn 






'395 




400 


Gly 


Tyr 


Pro Lys 


Ser' 


He Ser Gly Ala 




410 




415 


Val 


Asp 


Ala Val 


Phe 


Gin Gin Glu His 




425 






430 


Pro 


Arg 


Tyr Tyr Ala Phe Asp Leu He 


440 








445 


Ala 


Arg 


Gly Asn Lys Trp Leu Asn Cys 
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290 295 300 

Arg Arg His Pro Gin Leu Gin 
305 310 
Phe Trp Pro Ser Leu Pro Thr 
325 

Asp Arg Asp Leu He Phe Leu 
340 

Ser Gly Tyr Asp He Leu Gin 
355 

Gly Phe Pro Ser Ser Val Gin 
370 375 
Ser Lys Thr Tyr Phe Phe Val 
385 390 
Gin Arg Gin Phe Met Glu Pro 
405 

Phe Pro Gly He Glu Ser Lys 
420 

Phe Phe His Val Phe Ser Gly 
435 

Ala Gin Arg Val Thr Arg Val 

450 455 460 

Arg Tyr Gly 
465 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Leu Ser Trp Thr His 
15 

Glu Asp Asp Leu Ser 
30 

Arg Ser Tyr Tyr His 
45 

Ala Ala Ser Ser Met 

60 

Gly Leu Glu Val Thi 
80 

Lys Lys Pro Arg Cys 
95 

Pro Arg Thr Leu Lys 
110 

Asn Tyr Thr Pro Asj 

1^5 



Met 


His 


Pro 


Gly 


Val 


Leu 


Ala Ala 


Phe 


Leu 


Phe 


1 








5 








10 




Cys Arg 


Ala 


Leu 


Pro 


Leu 


Pro Ser Gly Gly Asp 








20 








25 






Glu 


Glu 


Asp 


Leu 


Gin 


Phe 


Ala Glu Arg Tyr 


Leu 






35 








40 








Pro 


Thr 


Asn 


Leu 


Ala Gly 


He Leu 


Lys 


Glu 


Asn 




50 










55 








Thr 


Glu 


Arg 


Leu 


Arg 


Glu 


Met Gin 


Ser 


Phe 


Phe 


65 








70 








75 


Gly 


Lys 


Leu 


Asp 


Asp Asn 


Thr Leu 


Asp 


val 


Met 






85 








90 




Gly Val 


Pro 


Asp 


Val 


Gly 


Glu Tyr 


Asn 


Val 


Phe 








100 








105 






Trp 


Ser 


Lys 


Met 


Asn 


Leu 


Thr Tyr 


Arg 


He 


Val 




115 








120 








Met 


Thr 


His 


Ser 


Glu 


Val 


Glu .Lys 


Ala 


Phe 


Lys 
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130 






135 






140 








Trp 


Ser 


Asp Val 


Thr 


Pro Leu Asn Phe 


Thr 


Arg 


Leu 


His 


Asp Gly He 


145 








150 




155 








160 


Ala 


Asp 


lie Met 


lie 


Ser Phe Gly He 


Lys 


Glu 


His 


Gly 


Asp 


Phe Tyr 








165 




170 










175 


Pro 


Phe Asp Gly Pro Ser Gly Leu Leu 


Ala 


His 


Ala 


Phe 


Pro 


Pro Gly 






180 




185 










190 




Pro Asn Tyr Gly Gly Asp Ala His Phe 


Asp Asp Asp 


Glu 


Thr Trp Thr 






195 




200 








205 






Ser Ser Ser Lys Gly Tyr Asn Leu Phe 


Leu 


Val 


Ala 


Ala 


His 


Glu Phe 




210 






215 






220 








Gly His 


Ser Leu Gly Leu Asp His Ser 


Lys Asp 


Pro 


Gly 


Ala 


Leu Met 


225 








230 




235 








240 


Phe 


Pro 


lie Tyr Thr Tyr Thr Gly Lys 


Ser 


His 


Phe 


Met 


Leu 


Pro Asp 








245 




250 










255 


Asp Asp Val Gin Gly lie Gin Ser Leu 


Tyr Gly Pro 


Gly 


Asp Glu Asp 






260 




265 










270 




Pro 


Asn 


Pro Lys 


His 


Pro Lys Thr Pro 


Asp 


Lys 


Cys 


Asp 


Pro 


Ser Leu 






275 




280 








285 






Ser Leu Asp Ala 


He 


Thr Ser Leu Arg 


Gly Glu Thr 


Met 


He 


Phe Lys 




290 






295 






300 








Asp 


Arg 


Phe Phe 


Trp 


Arg Leu His Pro 


Gin 


Gin 


Val 


Asp 


Ala 


Glu Leu 


305 








310 




315 








320 


Phe 


Leu 


Thr Lys 


Ser 


Phe Trp Pro Glu 


Leu 


Pro 


Asn 


Arg 


He 


Asp Ala 








325 




330 










335 


Ala 


Tyr 


Glu His 


Pro 


Ser. His Asp Leu 


He 


Phe 


He 


Phe 


Arg Gly Arg 






340 




345 










350 




Lys 


Phe 


Trp Ala 


Leu 


Asn Gly Tyr Asp 


He 


Leu 


Glu 


Gly 


Tyr 


Pro Lys 






355 




360 








365 






Lys 


lie 


Ser Glu 


Leu 


Gly Leu Pro Lys 


Glu 


Val 


Lys 


Lys 


He 


Ser Ala 




370 






375 






380 








Ala 


Val 


His Phe Glu Asp Thr Gly Lys 


Thr 


Leu 


Leu 


Phe 


Ser 


Gly Asn 


385 








390 




395 








400 


Gin 


Val 


Trp Arg 


Tyr Asp Asp Thr Asn 


His 


He 


Met 


Asp 


Lys 


Asp Tyr 








405 




410 










415 


Pro 


Arg 


Leu lie 


Glu Glu Asp Phe Pro Gly 


He Gly 


Asp 


Lys 


Val Asp 






420 




425 










430 




Ala 


Val 


Tyr Glu Lys Asn Gly Tyr lie 


Tyr 


Phe 


Phe 


Asn 


Gly 


Pro He 






435 




440 








445 






Gin 


Phe Glu Tyr Ser 


He Trp Ser Asn Arg 


He 


Val 


Arg 


Val 


Met Pro 




450 






455 






460 








Ala 


Asn 


Ser He 


Leu 


Trp Cys 















465 470 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Lys Phe Leu Leu lie Leu Leu Leu Gin Ala Thr Ala Ser Gly Ala 

15 10 15 

Leu Pro Leu Asn Ser Ser Thr Ser Leu Glu Lys Asn Asn Val Leu Phe 

20 25 . 30 

Gly Glu Arg Tyr Leu Glu Lys Phe Tyr Gly Leu Glu lie Asn Lys Leu 

35 40 45 

Pro Val Thr Lys Met Lys Tyr Ser Gly Asn Leu Met Lys Glu Lys He 

50 55 60 

Gin Glu Met Gin His Phe Leu Gly Leu Lys Val Thr Gly Gin Leu Asp 
€5 70 75 80 

Thr Ser Thr Leu Glu Met Met His Ala Pro Arg Cys Gly Val Pro Asp 

85 90 95 . 

Val His His Phe Arg Glu Met Pro Gly Gly Pro Val Trp Arg Lys His 

100 105 110 

Tyr He Thr Tyr Arg He Asn Asn Tyr Thr Pro Asp Met Asn Arg Glu 

115 120 125 

Asp Val Asp Tyr Ala He Arg Lys Ala Phe Gin Val Trp Ser Asn Val 

130 135 140 

Thr Pro Leu Lys Phe Ser Lys He Asn Thr Gly Met Ala Asp He Leu 
145 150 155 160 

Val Val Phe Ala Arg Gly Ala His Gly Asp Phe His Ala' Phe Asp Gly 

165 170 175 

Lya Gly Gly He Leu Ala His Ala Phe Gly Pro Gly Ser Gly He Gly 

180 185 190 

Gly Asp Ala His Phe Asp Glu Asp Glu Phe Trp Thr Thr His Ser Gly 

195 200 205 

Gly Thr Asn Leu Phe Leu Thr Ala Val His Glu He Gly His Ser Leu 

210 215 220 

Gly Leu Gly His Ser Ser Asp Pro Lys Ala Val Met Phe Pro Thr Tyr 
225 230 235 240 

Lys Tyr Val Asp He Asn Thr Phe Arg Leu Ser Ala Asp Asp He Arg 

245 250 255 

Gly He Gin Ser Leu Tyr Gly Asp Pro Lys Glu Asn Gin Arg Leu Pro, 

260 265 270 

Asn Pro Asp Asn Ser Glu Pro Ala Leu Cys Asp Pro Asn Leu Ser Phe 

275 2B0 285 

Asp Ala Val Thr Thr Val Gly Asn Lys He Phe Phe Phe Lys Asp Arg 

290 295 300 

Phe Phe Trp Leu Lys Val Ser Glu Arg Pro Lys Thr Ser Val Asn Leu 
305 310 315 320 

He Ser Ser Leu Trp Pro Thr Leu Pro Ser Gly He Glu Ala Ala Tyr 

325 330 335 

Glu He Glu Ala Arg Asn Gin Val Phe Leu Phe Lys Asp Asp Lys Tyr 

340 345 350 

Trp Leu He Ser Asn Leu Arg Pro Glu Pro Asn Tyr Pro Lys Ser He 

- 355 360 365 

His Ser Phe Gly Phe Pro Asn Phe Val Lys Lys He Asp Ala Ala Val 

370 375 380 

Phe Asn Pro Arg Phe Tyr Arg Thr Tyr Phe Phe Val Asp Asn Gin Tyr 
385 390 . 395 400 

Trp Arg Tyr Asp Glu Arg Arg Gin Met Met Asp Pro Gly Tyr Pro Lys 
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405 410 415 

^ Leu . lie Thr Lys Asn Phe Gin Gly lie Gly Pro Lys He Asp Ala Val 
420 425 430 

Phe Tyr Ser Lys Asn Lys Tyr Tyr Tyr Phe Phe Gin Gly Ser Asn Gin 

435 440 445 

Phe Glu Tyr Asp Phe Leu Leu Gin Arg He Thr Lys Thr Leu Lys Ser 

450 455 460 

Asn Ser Trp Phe Gly Cys 
465 .470 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



Met 


Arg 


Leu 


Thr 


Val 


Leu 


Cys 


Ala Val 


Cys 


Leu 


Leu Pro Gly Ser Leu 


1 








5 








10 




15 


Ala 


Leu 


Pro 


Leu 


Pro 


Gin 


Glu 


Ala Gly 


Gly 


Met 


Ser Glu Leu Gin Trp 








20 








25 






30 


Glu 


Gin 


Ala 


Gin 


Asp 


Tyr 


Leu 


Lys Arg 


Phe 


Tyr 


Leu Tyr Asp Ser Glu 






35 










40 






45 


Thr 


Lys 


Asn 


Ala 


Asn 


Ser 


Leu 


Glu Ala 


Lys 


Leu 


Lys Glu Met Gin Lys 




50 










55 








60 . 


Phe 


Phe 


Gly 


Leu 


Pro 


He 


Thr 


Gly Met 


Leu 


Asn 


Ser Arg Val He Glu 


65 










70 








75 


. 80 


He 


Met 


Gin 


Lys 


Pro 


Arg 


Cys 


Gly Val 


Pro 


Asp 


Val Ala Glu Tyr Ser 










85 








90 




95 


Leu 


Phe 


Pro 


Asn 


Ser 


Pro 


Lys 


Trp Thr 


Ser 


Lys 


Val Val Thr Tyr Arg 








100 








105 






110 


He 


Val 


Ser 


Tyr 


Thr 


Arg 


Asp 


Leu Pro 


His 


He 


Thr Val Asp Arg Leu 






115 










120 , 






125 


Val 


Ser 


Lys 


Ala 


Leu 


Asn 


Met 


Trp Gly 


Lys 


Glu 


He Pro Leu His Phe 




130 










135 








140 


Arg 


Lys 


Val 


Val 


Trp 


Gly 


Thr 


Ala Asp 


.He 


Met 


He Gly Phe Ala Arg 


145 










150 








155 


160 


Gly 


Ala 


His 


Gly 


Asp 


Ser 


Tyr 


Pro Phe 


Asp 


Gly 


Pro Gly Asn Thr Leu 










165 








170 




175 


Ala 


His 


Ala 


Phe 


Ala 


Pro 


Gly 


Thr Gly 


Leu 


Gly 


Gly Asp Ala His Phe 








180 








185 






190 


Asp 


Glu 


Asp 


Glu 


Arg 


Trp 


Thr 


Asp Gly 


Ser 


Ser 


Leu Gly He Asn Phe 






195 










200 






205 


Leu 


Tyr 


Ala 


Ala 


Thr 


His 


Glu 


Leu Gly 


His 


Ser 


Leu Gly Met Gly His 




210 










215 








220 


Ser 


Ser 


Asp 


Pro 


Asn 


Ala 


Val 


Met Tyr 


Pro 


Thr 


Tyr Gly Asn Gly Asp 


225 










230 








235 


240 


Pro 


Gin 


Asn 


Phe 


Lys 


Leu 


Ser 


Gin Asp 


Asp 


He 


Lys g:.v lie Gin Lys 
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245 250 255 

Leu Tyr Gly Lys Arg Ser Asn Ser -Arg Lys Lys 
260 265 

(2) INFORMATION FOR SEQ ID NO; 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



Met 


Ser 


Pro 


Ala 


Pro 


Arg 


Pro 


Ser 


Arg 


Cys 


Leu 


Leu 


Leu Pro Leu 


Leu 


1 








5 










10 






15 




Thr 


Leu 


Gly 


Thr 


Ala 


Leu 


Ala 


Ser 


Leu 


Gly 


Ser 


Ala 


Gin Ser Ser 


Ser 








20 










25 








30 




Phe 


Ser 


Pro 


Glu 


Ala 


Trp 


Leu 


Gin 


Gin 


Tyr 


Gly 


Tyr 


Leu Pro Pro 


Gly 






35 










40 










45 




Asp 


Leii 


Arg 


Thr 


His 


Thr 


Gin 


Arg 


Ser 


Pro 


Gin 


Ser 


Leu Ser Ala 


Ala 




50 










55 










60 






He 


Ala 


Ala 


Met 


Gin 


Lys 


Phe 


Tyr 


Gly 


Leu 


Gin 


Val 


Thr Gly Lys 


Ala 


65 










70 










75 






60 


Asp 


Ala 


Asp 


Thr 


Met 


Lys 


Ala 


Met 


Arg 


Arg 


Pro 


Arg 


■Cys Gly Val 


Pro 










85 










90 






95 




Asp 


Lys 


Phe 


Gly 


Ala 


Glu 


He 


Lys 


Ala 


Asn 


Val 


Arg 


Arg Lys Arg 


Tyr 




100. 










105 








110 




Ala 


He 


Gin 


Gly 


Leu 


Lys 


Trp 


Gin 


His 


Asn 


Glu 


He 


Thr Phe Cys 


He 






115 










120 










125 




Gin 


Asn 


Tyr 


Thr 


Pro 


Lys 


Val 


Gly 


Glu 


Tyr 


Ala 


Thr 


Tyr Glu Ala 


He 




130 










135 










140 






Arg 


Lys 


Ala 


Phe 


Arg 


Val 


Trp 


Glu 


Ser 


Ala 


Thr 


Pro 


Leu Arg .Phe 


Arg 


145 










150 










155 






160 


Glu 


Val 


Pro 


Tyr 


Ala 


Tyr 


He 


Arg 


Glu 


Gly 


His 


Glu 


Lys Gin Ala 


Asp 










165 










170 






175 




He 


Met 


He 


Phe 


Phe 


Ala 


Glu 


Gly 


Phe 


His 


Gly 


Asp 


Ser Thr Pro 


.Phe 








180 










185 








190 




Asp 


Gly 


Glu 


Gly 


Gly 


Phe 


Leu 


Ala 


His 


Ala 


Tyr 


Phe 


Pro Gly Pro 


Asn 






195 










200 










205 




He 


Gly 


Gly 


Asp 


Thr 


His 


Phe 


Asp 


Ser 


Ala 


Glu 


Pro 


Trp Thr Val 


Arg 




210 










215 










220 






Asn 


Glu 


Asp 


Leu 


Asn 


Gly 


Asn 


Asp 


He 


Phe 


Leu 


Val 


Ala Val His 


Glu 


225 










230 










235 






240 


Leu 


Gly 


His 


Ala 


Leu 


Gly 


Leu 


Glu 


His 


Ser 


Ser 


Asp 


Pro Ser Ala 


He 










245 










250 






255 




Met 


Ala 


Pro 


Phe 


Tyr 


Gin 


Trp 


Met 


Asp 


Thr 


Glu 


Asn 


Phe Val Leu 


Pro 








260 










265 








270 




Asp 


Asp 


Asp 


Arg 


Arg 


Gly 


He 


Gin 


Gin 


Leu 


-Tyr 


Gly 


Gly Glu Ser 


Gly 






275 










280 










:85 




Phe 


Pro 


Thr 


Lys 


Met 


Pro 


Pro 


Gin 


Pro 


Arg 


Thr 


Thr 


Ser Arg Pro 


Ser 



wo 98/40475 

PCT/US98/04694 

80 



NOT FURNISHED UPON FILING 



NO PRESENT ADO(A) EN EL MOMENTO DE LA PRESENTACION 



NON SOUMIS(E) AU MOMENT DU DEPOT 
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20 


25 




30 




Leu 


Leu 


Val 


Leu Leu Gly Cys 


Leu Gly Leu Gly 


Val Ala 


Ala 


Glu Asp 






35 




40 


45 






Ala 


Glu 


Val 


His Ala Glu Asn 


Trp Leu Arg Leu 


Tyr Gly Tyr 


Leu Pro 




50 




55 




60 






Gin 


Pro 


Ser 


Arg His Met Ser 


Thr Met Arg Ser 


Ala Gin 


He 


Leu Ala 


65 






70 


75 






80 


Ser 


Ala 


Leu 


Ala Glu Met Gin 


Arg Phe Tyr Gly 


He Pro 


Val 


Thr Gly 








85 


90 






95 


Val 


Leu 


Asp 


Glu Glu Thr Lys 


Glu Trp Met Lys 


Arg Pro Arg 


Cys Gly 








100 


105 




110 




Val 


Pro 


Asp 


Gin Phe Gly Val 


Arg Val Lys Ala 


Asn Leu Arg 


Arg Arg 






115 




120 


125 






Arg 


Lys 


Arg 


Tyr. Ala Leu Thr 


Gly Arg Lys Trp 


Asn Asn 


His 


His Leu 




130 




135 




140 






Thr 


Phe 


Ser 


He Gin Asn Tyr 


Thr Glu Lys Leu 


Gly Trp Tyr 


His Ser 


145 






150 


155 






. 160 


Mec 


Glu 


Ala 


Val Arg Arg Ala 


Phe Arg Val Trp 


Glu Gin Ala 


Thr Pro 








165 


170 






175 


Leu 


Val 


Phe 


Gin Glu Val Pro 


Tyr Glu Asp He 


Arg Leu 


Arg 


Arg Gin 








130 


185 




190 




Lys 


Glu 


Ala 


Asp He Met Val 


Leu Phe Ala Ser 


Gly Phe 


His 


Gly Asp 






195 




'200 


205 






Ser 


Ser 


Pro 


Phe Asp Gly Thr 


Gly Gly Phe Leu 


Ala His 


Ala 


Tyr Phe 




210 




215 




220 






Pro 


Gly 


Pro 


Gly Leu Gly Gly 


Asp Thr His Phe 


Asp Ala Asp 


Glu Pro 


225 






230 


235 






240 


Trp 


Thr 


Phe 


Ser Ser Thr Asp 


Leu His Gly Asn 


Asn Leu 


Phe 


Leu Val 








245 


250 






255 


Ala 


Val 


His 


Glu Leu Gly His 


Ala Leu Gly Leu 


Glu His 


Ser 


Ser Asn 








260 


265 




270 




Pro 


Asn 


Ala 


He Met Ala Pro 


Phe . Tyr Gin Trp 


Lys Asp 


Val 


Asp Asn 






275 




280 


285 






Phe 


Lys 


Leu 


Pro Glu Asp Asp 


Leu Arg Gly He 


Gin Gin 


Leu 


Tyr Gly 




290 




295 




300 






Thr 


Pro 


Asp 


Gly Gin Pro Gin 


Pro Thr Gin Pro 


Leu Pro 


Thr 


Val Thr 


305 






310 


315 






320 


Pro 


Arg 


Arg 


Pro Gly Arg Pro 


Asp His Arg Pro 


Pro Arg 


Pro 


Pro Gin 








325 


330 






335 


Pro 


Pro 


Pro 


Pro Gly Gly Lys 


Pro Glu Arg Pro 


Pro Lys 


Pro 


Gly Pro 








340 


345 




350 




Pro 


Val 


Gin 


Pro Arg Ala Thr 


Glu Arg Pro Asp 


Gin Tyr 


Gly 


Pro Asn 






355 




360 


365 






He 


Cys 


Asp 


Gly Asp Phe Asp 


Thr Val Ala Met 


Leu Arg Gly 


Glu Met 




370 




375 




380 






Phe 


Val 


Phe 


Lys Gly Arg Trp 


Phe Trp Arg Val 


Arg His 


Asn 


Arg Val 


385 






390 


395 






400 


Leu 


Asp 


Asn 


Tyr Pro Met Pro 


He- Gly His Phe 


Trp Arg 


Gly 


Leu Pro 








405 


410 






415 


Gly 


Asp 


He 


Ser Ala Ala Tyr 


Glu Arg Gin Asp 


Gly Arg 


Phe 


Val Phe 








420 


425 




430 




Phe 


Lys 


Gly 


Asp Arg Tyr Trp 


Leu Phe Arg Glu 


Ala Asn 


Leu 


Glu Pro 






435 




440 


.445 
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- Gly Tyr Pro, Gin Pro Leu Thr Ser Tyr Gly Leu Gly He Pro Tyr Aso 
450 455 460 

Arg He Asp Thr Ala He Trp Trp Glu Pro Thr Gly His Thr Phe Phe 
470 475 480 

Phe Gin Glu Asp Arg Tyr Trp Arg Phe Asn Glu Glu Thr Gin Arg Gly 

485 490 495 

Asp Pro Gly Tyr Pro Lys Pro He Ser Val Trp Gin Gly He Pro Ala * 

500 ■ 505 . 510 

Ser Pro Lys Gly Ala Phe Leu Ser Asn Asp Ala Ala' Tyr Thr Tyr Phe 

515 520 525 

Tyr Lys Gly Thr Lys Tyr Trp Lys Phe Asp Asn Glu Arg Leu Arg Met 

530 535 540 

Glu Pro Gly Tyr Pro Lys Ser lie Leu Arg Asp Phe Met Gly Cys Gin 
545 550 555 ^ 560 

Glu His Val Glu Pro Gly Pro Arg Trp Pro Asp Val Ala Arg Pro Pro 

565 570 575 

Phe Asn Pro His Gly Gly Ala Glu Pro Gly Ala Asp Ser Ala Glu Gly 

580 585 590 

Asp Val Gly Asp Gly Asp Gly Asp Phe Gly Ala Gly Val Asn Lya Asp 

595 600 . 605 . 

Gly Gly Ser Arg Val Val Val Gin Met Glu Glu Val Ala Arg Thr Val 

610 615 620 

Asn Val Val Met Val Leu Val Pro Leu Leu Leu Leu Leu Cys Val Leu 
^25 630 635 640 

Gly Leu Thr Tyr Ala Leu Val Gin Met Gin Arg Lys Gly Ala Pro Arg 

645 650 655 

Val Leu Leu Tyr Cys Lys Arg Ser Leu Gin Glu Trp Val 
660 665 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:30: 

Met He Leu Leu Thr_Phe Ser Thr Gly Arg Arg Leu Asp Phe Val His 

1 5 . 10 15 

His Ser Gly Val Phe Phe Leu Gin Thr Leu Leu Trp He Leu Cys Ala 

20 25 . 30 

Thr Val Cys Gly Thr Glu Gin Tyr Phe Asn Val Glu Val Trp Leu Gin 

35 40 ■ 45 

Lys Tyr Gly Tyr Leu Pro Pro Thr Ser Pro Arg Met Ser Val Val Arg 

SO 55 60 ^ 

Ser Ala Glu Thr Met Gin Ser Ala Leu Ala Ala Met Gin Gin Phe Tyr 
^5 .70 75 BO 

Gly He Asn Met Thr Gly Lys Val Asp Arg Asn Thr He Asp Trp Met 
85 90 95 
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Lys Lys Pro Arg Cys Gly Val Pro Asp Gin Thr Arg Gly Ser Ser Lys 

100 105 110 

Phe His He Arg Arg Lys Arg Tyr Ala ' Leu Thr Gly Gin Lys Trp Gin 

115 120 125 

His Lys His "He Thr Tyr Ser He Lys Asn Val Thr Pro Lys Val Gly 

130 135 140 

Asp Pro Glu Thr Arg Lys Ala He Arg Arg Ala Phe Asp Val Trp Gin 
145 150 155 160 

Asn Val Thr Pro Leu Thr Phe Glu Glu Val Pro Tyr Ser Glu Leu Glu 

165 170 175 

Asn Gly Lys Arg Asp Val Asp He Pro He He Phe Ala Ser Gly Phe 

180 185 190 

His Gly Asp Ser Ser Pro Phe Asp Gly Glu Gly Gly Phe Leu Ala His 

195 200 205 

Ala Tyr Phe Pro Gly Pro Gly He Gly Gly Asp Thr His Phe Asp Ser 

210 215 220 

Asp Glu Pro Trp Thr Leu Gly Asn Pro Asn His Asp Gly Asn Asp Leu 
225 230 235 240 

Phe Leu Val Ala Val His Glu Leu Gly His Ala Leu Gly Leu Glu His 

245 . 250 255 

Ser Asn Asp Pro Thr Ala He Met Ala Pro Phe Tyr Gin Tyr Met Glu 

260 265 270 

Gin Thr Leu Gin Leu Pro Asn Asp Asp Tyr Arg His Gin Arg Tyr Met 

275 280 265 

Ser Pro Asp Lys He Pro Pro Pro Thr Arg Pro Leu Pro Thr Val Pro 

290 295 300 

Pro His Arg Ser He Pro Pro Ala Asp Pro Arg Lys Asn Asp Arg Pro 
305 310 315 320 

Lys Pro Pro Arg Pro Pro Thr Gly Arg Pro Ser Tyr Pro Gly Ala Lys 

325 330 335 

Pro Asn He Cys Asp Gly Asn Phe Asn Thr Leu Ala He Leu Arg Arg 

340 345 3S0 

Glu -Met Phe Val Phe Lys Asp Gin Trp Phe Trp Arg Val Arg Asn Asn 

355 360 365 

Arg Val Met Asp Gly Tyr Pro Met Gin He Thr Tyr Phe Trp Arg Gly 

370 375 380 

Leu Pro Pro Ser He Asp Ala Val Tyr Glu Asn Ser Asp Gly Asn Phe 
385 390 395 400 

Val Phe Phe Lys Gly Asn Lys Tyr Trp Val Phe Lys Asp Thr Thr Leu 

405 410 415 

Gin Pro Gly Tyr Pro His Asp Leu He Thr Leu Gly Ser Gly He Pro 

420 425 430 

Pro His Gly He Asp Ser Ala He Trp Trp Glu Asp Val Gly Lys Thr 

435 440 445 

Tyr Phe Phe Lys Gly Asp Arg Tyr Trp Arg Tyr Ser Glu Glu Met Lys 

450 455 460 

Thr Mec Asp Pro Gly Tyr Pro Lys Pro He Thr Val Trp Lys Gly He 
465 470 475 480 

Pro Glu Ser Pro Gin Gly Ala Phe Val His Lys Glu Asn Gly Phe Thr 

485 490 495 

Tyr Phe Tyr Lys Glu Gly Val Leu Glu He Gin Thr Thr Arg Tyr Ser 

500 505 510 

Arg Leu Glu Pro Gly His Pro Arg Ser He Leu Lys Asp Leu Ser Gly 
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515 520 525 

Cys Asp Gly Pro Thr Asp Arg Val Lys Glu Gly'His Ser Pro Pro Asp 

530 535 540 

Asp Val Asp He Val He Lys Leu Asp Asn Thr Ala Ser Thr Val Lys 
545 550 555 560 

Ala He Ala He Val He Pro Cys He Leu Ala Leu Cys Leu Leu Val 

565 570 . S7S 

Leu Val Tyr Thr Val Phe Gin Phe Lys Arg Lys Gly Thr Pro Arg Hie 

580 585 590 

He Leu Tyr Cys Lys Arg Ser Met Gin Glu Trp Val 
595 600 

(2) INFORMATION FOR SEC ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



Met 


Ala 


Pro 


Ala 


Ala 


Trp 


Leu 


Arg 


Ser 


Ala Ala 


Ala Arg 


Ala 


Leu 


Leu 


1 








5 










10 






15 




Pro 


Pro 


Met 


Leu 


Leu 


Leu 


Leu 


Leu 


Gin 


Pro. Pro 


Pro Leu 


Leu Ala Arg 








20 










25 






30 






Ala 


Leu 


Pro 


Pro 


Asp 


Val 


His 


His 


Leu 


His Ala 


Glu Arg 


Arg 


Gly 


Pro 






35 










40 






45 








Gin 


Pro 


Trp 


His 


Ala 


Ala 


Leu 


Pro 


Ser 


Ser Pro 


Ala Pro 


Ala 


Pro 


Ala 




50 










55 








60 








Thr 


Gin 


Glu 


Ala 


Pro 


Arg 


Pro 


Ala 


Ser 


iser Leu 


Arg Pro 


Pro 


Arg 


Cys 


65 










70 








75 








60 


Gly 


Val 


Pro 


Asp 


Pro 


Ser 


Asp 


Gly 


Leu 


Ser Ala 


Arg Asn 


Arg 


Gin 


Lys 










85 










90 






95 




Arg 


Phe 


Val 


Leu 


Ser 


Gly 


Gly 


Arg 


Trp 


Glu Lys 


Thr Asp 


Leu 


Thr 


Tyr 








100 










105 






110 






Arg 


He 


Leu 


Arg 


Phe 


Pro 


Trp 


Gin 


Leu 


Val Gin 


Glu Gin 


Val 


Arg 


Gin 






115 










120 






125 








Thr 


Met 


Ala 


Glu 


Ala 


Leu 


Lys 


Val 


Trp 


Ser Asp 


Val .Thr 


Pro 


Leu 


Thr 




130 










135 








140 








Phe 


Thr 


Glu 


Val 


His 


Glu 


Gly 


Arg 


Ala 


Asp He 


Met He 


Asp 


Phe 


Ala 


145 










150 








155 








160 


Arg 


Tyr 


Trp 


Asp 


Gly 


Asp 


Asp 


Leu 


Pro 


Phe Asp 


Gly Pro 


Gly 


Gly 


He 










165 










170 . 






175 




Leu 


Ala 


His 


Ala 


Phe 


Phe 


Pro 


Lys 


Thr 


His Arg 


Glu Gly 


Asp 


Val 


His 






180 










185 






190 






Phe 


Asp 


Tyr 


Asp 


Glu 


Thr 


Trp 


Thr 


He 


Gly Asp 


Asp Gin 


Gly Thr Asp 






195 










200 






205 








Leu 


Leu 


Gin 


Vai 


Ala 


Ala 


His 


Glu 


Phe 


Gly His 


Val Leu 


Gly 


Leu 


Gin 




:^i0 










215 








220 








His 


Thr 


Thr 


Ala 


Ala 


Lys 


Ala 


Leu 


Met 


Ser Ala 


Phe Tyr 


Thr 


Phe 


Arg 
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225 










230 








235 240 


Tyr 


Pro 


Leu 


Ser 


Leu 


Ser 


Pro 


Asp 


Asp 


Cys Arg Gly Val Gin His Leu 










245 










250 255 


Tyr 


Gly 


Gin 


Pro 


Trp 


Pro 


Thr 


Val 


Thr 


Ser Arg Thr, Pro Ala Leu Gly 






260 










265 


270 


Pro 


Gin 


Ala 


Gly 


He 


Asp 


Thr 


Asn 


Glu 


He Ala Pro Leu Glu Pro Asp 






275 










260 




285 


Ala 


Pro 


Pro 


Asp 


Ala 


Cys 


Glu 


Ala 


Ser 


Phe Asp Ala Val Ser Thr lie 




290 










295 






300 


Arg 


Gly GIu 


Leu 


Phe 


Phe 


Phe 


Lys 


Ala 


Gly Phe Val Trp Arg Leu Arg 


305 










310 








315 320 


Gly 


Gly Gin 


Leu 


Gin 


Pro 


Gly 


Tyr 


Pro 


Ala Leu Ala Ser Arg His Trp 








325 










330 335 


Gin 


Gly 


Leu 


Pro 


Ser 


Pro 


Val 


Asp 


Ala 


Ala Phe Glu Asp Ala Gin Gly 




340 










345 


350 


His 


lie 


Trp 


Phe 


Phe 


Gin 


Gly 


Ala 


Gin 


Tyr Trp Val Tyr Asp Gly Glu 






355 










360 




365 


Lys 


Pro 


Val 


Leu 


Gly Pro 


Ala 


Pro 


Leu 


Thr Glu Leu Gly Leu Val Arg 


370 










375 






380 


Phe 


Pro 


Val 


His 


Ala 


Ala 


Leu 


Val 


Trp 


Gly Pro Glu Lys Asn Lys He 


385 










390 








395 400 


Tyr 


Phe 


Phe 


Arg 


Gly Arg 


Asp 


Tyr 


Trp 


Arg Phe His Pro Ser Thr Arg 








405 










410 415 


Arg 


val 


Asp 


Ser 


Pro 


Val 


Pro 


Arg 


Arg 


Ala Thr Asp Trp Arg Gly Val 




420 










425 


430 


Pro 


Ser 


Glu 


He 


Asp 


Ala 


Ala 


Phe 


Gin 


Asp Ala Asp Gly Tyr Ala Tyr 






435 










4 40 




445 


Phe 


Leu 


Arg 


Gly 


Arg 


Leu 


Tyr 


Trp 


Lys 


Phe Asp Pro Val Lys Val Lys 




450 










455 






460 


Ala 


Leu 


Glu 


Gly 


Phe 


Pro 


Arg 


Leu 


Val 


Gly Pro Asp Phe Phe Gly Cys 


465 








470 








475 480 


Ala 


Glu 


Pro 


Ala 


Asn 


Thr 


Phe 


Leu 







485 

(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Met Gin Gin Phe Gly Gly Leu Glu Ala Thr Gly He Asp Glu Ala Thr 

1 5 10 15 

Leu Ala Leu Met Lys Thr Pro Arg Cys Ser Leu Pro Asp Leu Pro Val 

20 25 30 

Leu Thr Gin Ala Arg Arg Arg Arg Gin Ala Pro Ala Pro Thr Lys Trp 

35 40 45 

Asn Lys Arg Asn Leu Ser Trp Arg Val Arg Tar Phe Pro Arg Asp Ser 
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50 55 , 60 

Pro Leu Gly His Asp Thr Val Arg Ala Leu Met Tyr Tyr Ala Leu.Lys 
65 70 75 80 

Val Trp Ser Asp He Ala Pro Leu Asn Phe His Glu Val Ala Gly Ser 

85 90 95 

Thr Ala Asp He Gin He Asp Phe Ser Lys Ala Asp His Asn Asp Gly 

100 .105 110 
Tyr Pro Phe Asp Ala Arg Arg His Arg Ala His Ala Phe Phe Pro Gly 

. 115 120 125 

His His His Thr Ala Gly Tyr Thr His Phe Asn Asp Asp Glu Ala Trp 

130 135 140 

Thr Phe Arg Ser Ser Asp Ala His Gly Met Asp Leu Phe Ala Val Ala 
145 150 155 160 

Val His Glu Phe Gly His Ala He Gly Leu Ser His Val Ala Ala Ala 

165 170 175 

His Ser He Met Arg Pro Tyr Tyr Gin Gly Pro Val Gly Asp Pro Leu 

180 185 190 

Arg Tyr Gly Leu Pro Tyr Glu Asp Lys Val Arg Val Trp Gin Leu Tyr 

195 200 205 

ciy val Arg Glu Ser Val Ser Pro Thr Ala Gin Pro Glu Glu Pro Pro 

210 215 220 

Leu Leu Pro Glu Pro Pro Asp Asn Arg Ser Ser Ala Pro Pro Arg Lys 
225 230 235 240 

Asp Val Pro His Arg Cys. Ser Thr His Phe Asp Ala Val Ala Gin He 

245 250 255 

Arg Gly Glu Ala Phe Phe Phe Lys Gly Lys Tyr Phe Trp Arg Leu Thr 

260 265 270 

Arg Asp Arg His Leu Val Ser Leu Gin Pro Ala Gin Met His Arg Phe 

275 280 285 

Trp Arg Gly Leu Pro Leu His Leu Asp Ser Val Asp Ala Val Tyr Glu 

290 295 300 

Arg Thr Ser Asp His Lys He Val Phe Phe Lys Gly Asp Arg Tyr Trp 
305 310 315 320 

Val Phe Lys Asp Asn Asn Val Glu Glu Gly Tyr Pro Arg Pro Val Ser 

325 330 335 

Asp Phe Ser Leu Pro Pro Gly Gly He Asp Ala Ala Phe Ser Trp Ala 

340 345 350 

His Asn Asp Arg Thr Tyr Phe Phe Lys Asp Gin Leu Tyr Trp Arg Tyr^ 

355 360 365 

Asp Asp His Thr Arg His Met Asp Pro Gly Tyr Pro Ala Gin Ser Pro 

370 375 380 

Leu Trp Arg Gly Val Pro Ser Thr Leu Asp. Asp Ala Met Arg Trp Ser 
385 390 395 400 

Asp Gly Ala Ser Tyr Phe Phe Arg Gly Gin Glu Tyr Trp Lys Val Leu 

405 410 415 

Asp Gly Glu Leu Glu Val Ala Pro Gly Tyr Pro Gin Ser Thr Ala Arg 

420 425 430 

Asp Trp Leu Val Cys Gly Asp Ser Gin Ala Asp Gly Ser Val Ala Ala' 

435 440 ' 445 

Gly Val Asp Ala Ala Glu Gly Pro Arg Ala Pro Pro Gly Gin His Asp 

450 455 '460 

Gin Ser Arg Ser Glu Asp Gly Tyr Glu Val Cys Ser Cys Thr Ser Gly 
465 470 -475 480 
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Ala Ser Ser Pro Pro Gly Ala Pro Gly Pro Leu Val Ala Ala Thr Met 

485 490 495 

Leu Leu Leu Leu Pro Pro Leu Ser Pro Gly Ala Leu Trp Thr Ala Ala 

500 505 510 

Gin Ala Leu Thr Leu 
515 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Arg Xaa Lys Arg 
1 
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What is claimed: 

1 . A purified polynucleotide or fragment thereof derived from human matrix 
metalloproteinase 19 (MMP19) gene wherein said purified polynucleotide is capable of 
selectively hybridizing to the nucleic acid of said MMP19 gene, and wherein said purified 
polynucleotide has at least 50% identity to SEQ ID NO: 1 and fragments, analogs or 
complements thereof. 

2 . The purified polynucleotide of Claim 1 wherein said purified polynucleotide is 
produced by recombinant techniques. . 

,3 . The purified polynucleotide of Claim 2 wherein said polynucleotide 
produced by recombinant techniques comprises a sequence of at least one epitope encoded 
byMMP19. 

4. The purified polynucleotide of Claim 3 wherein said epitope is selected 
from the group consisting of SEQ E) NO: 14, SEQ ED NO: 1 5, SEQ ID NO: 16, and SEQ 
ID NO: 17. 

5. A purified polynucleotide of MMP19 which encodes an MMP 19 protein 
comprising an amino acid sequence which has at least 40% identity to SEQ ID NO: 10. 

6. The purified polynucleotide of Claim 5 which encodes an MMP19 protein 
having SEQ ID NO: 10. 

7 . A recombinant expression system comprising a nucleic acid sequence that 
encodes an open reading frame derived from MMP19 which is opcrabiy linked to a control 
sequence compatible with a desired host and wherein said nucleic acid sequence is capable 
of selectively hybridizing to the nucleic acid of said MMP 19 gene, and wherein said 
purified polynucleotide has at least 50% identity to SEQ ID NO: J and fragments, analogs 
or complements thereof. 

8 . A host cell comprising the recombinant expression system of Claim 7. 

9 . An MMP 1 9 polypeptide wherein said polypeptide has at least 40% identity 
to SEQ ID NO: 10 and fragments thereof. 
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1 0. The polypeptide of Claim 9 which further possesses the ability to cleave 
SEQIDN0:13. 

11. The polypeptide of Claim 9 selected from the group consisting of SEQ ID 
NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, and SEQ ID NO: 17. 

12. A method of detecting the presence of a target polynucleotide of MMP 1 9 in 

a test sample, comprising: 

(a) contacting said test sample with at least one iMMP19 specific polynucleotide 

or complement thereof; and 

(b ) detecting the presence of said target polynucleotide of MMP 19 in the test 

sample; 

wherein said MMP19 specific polynucleotide has at least 50% identity to polynucleotide 
SEQ ID NO: 1 and fragments, analogs or complements thereof. 

•13. The method of Claim 14 wherein said target polynucleotide of MMP 1 9 is 
attached to a solid phase prior to performing step (a). 

14. A method for detecting mRNA of MMP 1 9 in a test sample, comprising: 

(a) contacting said test sample with at least one primer; 

(b) performing reverse transcription in order to produce cDNA; 

(c) amplifying said cDNA obtained from step (b) by using other 
oligonucleotide primer(s) of MMP19 as sense and antisense primer(s) in a first-stage 
amplification to obtain MMPl 9 amplicon; 

(d) detecting the presence of said MMP 1 9. amplicon in the test sample; 
wherein said oligonucleotide primers of MMPl 9 have at least 50% identity to SEQ ID NO: 
1 and fragments, analogs or complements thereof. 

1 5. A method of detecting target MMP19 polynucleotide in a test sample 
suspected of containing said target, comprising: 

(a) contacting said target MMP19 poiynucleoiide with at lease on MMPl 9 
oligonucle:..cide as sense primer and with at least one MMP19 oligonucleotide as an 
antisense primer and amplifying same to obtain a first stage reaction product; 

(b) contacting said first stage reaction product with at least one other MMP 1 9 
oligonucleotide, with the proviso that the other MMP 19 oligonucleotide is located 3' to the 
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MMP 1 9 oligonucleotides utilized in step (a) and is complennentary to said first stage 
reaction product; and 

(c) detecting said target iMMP i 9 polynucleotide, 
wherein said MMP 19 oligonucleotides utilized in step (a) and step (b) have at least 50% 
identity to SEQ ID NO: 1 and fragments, analogs or complements thereof. 

16. The method of Claim 1 4 or Claim 1 5 wherein said test sample is reacted 
with a solid phase prior to performing step (a) or step (b) or step (c). 

1 7 . The method of Claim 1 4 or Claim 1 5 wherein said detection step comprises 
utilizing a detectable label capable of generating a measurable signal. 

18. A method for identifying an inhibitor compound of MMP 19 comprising the 
steps of: 

(a) providing a reaction mixture coniprising 

(i) a substrate; 

(ii) an MMP 19 protein having at least 40% identity to SEQ ID NO: 10 or 
fragments thereof, said MMP19 protein further possessing the ability to cleave said 
substrate; and 

(iii) a compound of interest; 

(b) incubating the reaction mixture; and 

(c) determining the extent of cleavage .of the substrate in the reaction mixture. 

19. The method of Claim 1 8 wherein said substrate is SEQ ID NO: 1 3. 

20. The method of Claim 19 wherein said MMP 19 protein is SEQ ID NO: 10. 

21. An antibody which specifically binds to at least one epitope encoded by 
MMP 19, wherein said antibody is polyclonal or monoclonal and wherein said epitope 
comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 14, 
SEQ ID NO: 15, SEQ ID NO: 16, and SEQ ID NO: 17. 

22. An assay kit for determining the presence of MMP 19 antigen or antibody in 
a lest sample, comprising a container containing an antibody which specifically bins to 
MMP 19 antigen, wherein said antigen comprises at least one epitope of MMP 19 having at 
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least a sequence selected from the group consisting of SEQ ID NO: 14, SEQ ID NO: 15, 
SEQ ID NO: 16, and SEQ ID NO: 17. 

23. The kit of Claim 28 wherein said antibody is attached to a solid phase. 

24. A test kit useful for detecting MMP19 polynucleotide in a test sample, 
comprising a container containing at least one MMPi9 polynucleotide which is capable of 
selectively hybridizing to the nucleic acid of MMP19 gene and wherein said purified 
polynucleotide has at least 50% identity to SEQ ID NO: 1 an fragments, analogs or 
complements thereof. 
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FIG . 1 

xxxxxxxxxxxxxxxxxxxLxxLLLLxxxxxxxLxxxxxxxxxxxxxxxxxxxxxYLxxYYxLx 
xxxxxxxxxxxxxxxxxxxLxxMQxFFGLxVTGKLDxxTLExMxKPRCGVPDxxxxxxxxxVxx 

FxLxPxxPKWxxxxxTYRIxNYTPDLxxxxVDxAIxKAFxVWSxVTPLxFxxVxxxxxxxxxEG 
xADIMIxFAxxEHGDxxPFDGPGGxLAHAFxPGPGIGGDAHFDDDExWTxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxGxNLFLVA AHExGHSLGLxHS xDPxAL 

MYPxYxxFxDxxxFxLxxDDIxGIQxLYGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxPxxCDxxxxxxxxxxxxFDAIxxxRGExFFFKDRFFWRxx 
xxxxxxxxxxxLIxxFWPxLPxxIDAAYExxxxxxVFFFKGxxYWxYxGxxxxxGYPxxIxxxL 
GFPxxVxxIDAAxxxxxxxxKTYFFxxxxYWRYDExxxxMDPGYPKxIxxxFxGIxxxVDAVFx 
xxxxGFxYFFxGxxxYxFxxxxxxVxxxxxxxxxxxWLxCxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

FIG. 2 

1 50 

* ★ ★ . * . * 

CCTNCACTNGATTNAGCGATGCCATNAGAGCGTTTNAGTGGGTGTNCCAG 
XTXXSDAXRAF. XWVXQ 

51 100 

* * * * . 

CTACCTGTNAGCGGCGTGTTGGACCGCGCCACCCTGCGCCAGATGANTNG 
LPVSGVLDRA TLRQMXX 

' 101 . 150 

* * * . ■ * * 

TCCCCGGTGCGGGGTTACAGATACCAACAGTTATGNGGCCTGGGNTGAGA 
P R C G V T D T NSYXAWXE 

151 200 . 

***** 
GGATCAGTGANTTGTTTGCTAGACAACGGACCAAAATGAGGCGTAAGAAA 
RISXLF ARQRTKM R R K K 

201 250 
***** 
CGNTTTGCAAAGCAAGGTAACAAATGGTANAAGCAGCACCTNTTCTACCG 
R FAKQGNKWXKQHLFYR 



251 
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NNTGGTGAACTGGGCTGAGNNTGTGCCGGAGCCGGCAGTTGGGGGCGCG 
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^ FIG. 4(a) 



pro-peptide 

AAAGCTCCCACCTCCACTCGATTCAGCGATGCCATCAGAGCGTTTCAGTG 
TTTCGAGGGTGGAGGTGAGCTAAGTCGCTACGGTAGTCTCGCAAAGTCAC 
K.APTSTRFSDAIRAFQW 



GGTGTCCCAGCTACCTGTCAGCGGCGTGTTGGACCGCGCCAACCTGCGCC 
CCACAGGGTCGATGGACAGTCGCCGCACAACCTGGCGCGGTTGGACGCGG 
VSQLPVSGVLDRANLR 



AGATGACTCGTCCCCGCTGCGGGGTTACAGATACCAACAGTTATGCGGCC 
TCTACTGAGCAGGGGCGACGCCCCAATGTCTATGGTTGTCAATACGCCGG 
0 M T R P R C G V T D T N S Y A A 



TGGGCTGAGAGGATCAGTGACTTGTTTGCTAGACACCGGACCAAAATGAG 
ACCCGACTCTCCTAGTCACTGAACAAACGATCTGTGGCCTGGTTTTACTC 
WAERISDLFARHRTKMR 

catalytic domain 

GCGTAAGAAACGCTTTGCAAAGCAAGGTAACAAATGGTACAAGCAGCACC 
CGCATTCTTTGCGAAACGTTTCGTTCCATTGTTTACCATGTTCGTCGTGG 
R K K R FAKQGNKWYKQH 



TCTCCTACCGCCTGGTGAACTGGCCTGAGCATCTGCCGGAGCCGGCAGTT 
AGAGGATGGCGGACCACTTGACCGGACTCGTAGACGGCCTCGGCCGTCAA 
LSYRLVNWPEHLPEPAV 



CGGGGCGCCGTGCGCGCCGCCTTCCAGTTGTGGAGCAACGTCTCAGCGCT 
GCCCCGCGGCACGCGCGGCGGAAGGTCAACACCTCGTTGCAGAGTCGCGA 
RGAVRAAFQLWS. NVSAL 



GGAGTTCTGGGAGGCCCCAGCCACAGGCCCCGCTGACATCCGGCTCACCT 
CCTCAAGACCCTCCGGGGTCGGTGTCCGGGGCGACTGTAGGCCGAGTGGA 
EFWEAPATGPADIRLT 



TCTTCCAAGGGGACCACAACGATGGGCTGGGCAATGCCTTTGATGGCCCA 
AGAAGGTTCCCCTGGTGTTGCTACCCGACCCGTTACGGAAACTACCGGGT 
FFQGDHNDGLGNAFDGP 

GGGGGCGCCCTGGCGCACGCCTTCCTGCCCCGCCGCGGCGAAGCGCACTT 
CCCCCGCGGGACCGCGTGCGGAAGGACGGGGCGGCGCCGCTTCGCGTGAA 
GGALAHAFLPRR GEAHF 



CGACCAAGATGAGCGCTGGTCC. . .iAGCCGCCGCCGCGGGCGCAACCTGT 
GCTGGTTCTACTCGCGACCAGGGACTCGGCGGCGGCGCCCGCGTTGGACA 
DQDERWS LSRRRGRNL 
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FIG. 4(b) 

TCGTGGTGCTGGCGCACGAGATCGGTCACACGCTTGGCCTCACCCACTCG 
AGCACCACGACCGCGTGCTCTAGCCAGTGTGCGAACCGGAGTGGGTGAGC 
F V V L A HEIGHTLGLTHS 



CCCGCGCCGCGCGCGCTCATGGCGCCCTACTACAAGAGGCTGGGCCGCGA 
GGGCGCGGCGCGCGCGAGTACCGCGGGATGATGTTCTCCGACCCGGCGCT 
PAPRALMAPYYKRLGRD 



CGCGCTGCTCAGCTGGGACGACGTGCTGGCCGTGCAGAGCCTGTATGGGA 
GCGCGACGAGTCGACCCTGCTGCACGACCGGCACGTCTCGGACATACCCT 
ALLSWDDVLAVQSLYG 

hinge region 

AGCCCCTAGGGGGCTCAGTGGCCGTCCAGCTCCCAGGAAAGCTGTtCACT 
TCGGGGATCCCCCGAGTCACCGGCAGGTCGAGGGTCCTTTCGACAAGTGA 
KPLGGSVAVQLPGKLFT 



GACTTTGAGACCTGGGACTCCTACAGCCCCCAAGGAAGGCGCCCTGAAAC 
CTGAAACTCTGGACCCTGAGGATGTCGGGGGTTCCTTCCGCGGGACTTTG 
DFETWDSYSPQGRRPET 

heinopexin domain 

GCAGGGCCCTAAATACTGCCACTCTTCCTTCGATGCCATCACTGTAGACA 
CGTCCCGGGATTTATGACGGTGAGAAGGAAGCTACGGTAGTGACATCTGT 
QGPKYCHSSFDAIT.VD 



GGCAACAGCAACTGTACATTTTTAAAGGGAGCCATTTCTGGGAGGTGGCA 
CCGTTGTCGTTGACATGTAAAAATTTCCCTCGGTAAAGACCCTCCACCGT 
RQQQLYIFKGSHFWEVA 



GCTGATGGCAACGTCTCAGAGCCCCGTCCACTGCAGGAAAGATGGGTCGG 
CGACTACCGTTGCAGAGTCTCGGGGCAGGTGACGTCCTTTCTACCCAGCC 
ADGNVSEPRP. LQER WV.G 



GCTGCCCCCCAACATTGAGGCTGCGGCAGTGTCATTGAATGATGGAGATT 
CGACGGGGGGTTGTAACTCCGACGCCGTCACAGTAACTTACTACCTCTAA 
LPPNIEAAAVSLND GD 



TCTACTTCTTCAAAGGGGGTCGATGCTGGAGGTTCCGGGGCCCCTAGCCA 
AGATGAAGAAGTTTCCCCCAGCTACGACCTCCAAGGCCCCGGGGTTCGGT 
FYFFKGGRCWRFRGPKP 



GTGTGGGGTCTCCCACAGCTGTGCCGGGCAGGGGGCCTGCCCCGCCATCC 
CACACCCCAGAGGGTGTCGACACGGCCCGTCCCCCGGACGGGGCGGTAGG 
VWGLPQLCRAGGLPRHP 



TGACGCCGCCCTCTTCTTCCCTCCTCTGCGCCGCCTCATCCTCTTCAAGG 
ACTGCGGCGGGAGAAGAAGGGAGGAGACGCGGCGGAGTAGGAGAAGTTCC 
DAALFFPPLRRLILFK 
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GTGCCCGCTACTACGTGCTGGCCCGAGGGGGACTGCAAGTGGAGCCCTAC 1200 

CACGGGCGATGATGCACGACCGGGCTCCCCCTGACGTTCACCTCGGGATG 

GARYYVLARGGLQVEPY* 

TACCCCCGAAGTCTGCAGGACTGGGGAGGC ATCCCTGAGGAGGTCAGCGG 1250 
ATGGGGGCTTCAGACGTCCTGACCCCTCCGTAGGGACTCCTCCAGTCGCC 
Y PRSLQDWG 'GIPE EVSG 

CGCCCTGCCGAGGCCCGATGGCTCCATCATCTTCTTCCGAGATGACCGCT 1300 
GCGGGACGGCTCCGGGCTACCGAGGTAGTAGAAGAAGGCTCTACTGGCGA 
A L PRPDGS I I FFRDDR 



ACTGGCGCCTCGACCAGGCC AAACTGC AGGCAACCACC.TCGGGCCGCTGG 1350 
TGACCGCGGAGCTGGTCCGGTTTGACGTCCGTTGGTGGAGCCCGGCGACC 
YWRLDQAKLQA TTSGRW 



.GCCACCGAGCTGCCCTGGATGGGCTGCTGGCATGCCAACTCGGGGAGCGC 1400 
CGGTGGCTCGACGGGACCTACCCGACGACCGTACGGTTGAGCCCCTCGCG 
ATELPWMGCWHANSGSA 



CCTGTTCTGAAGGCACCTCCTCACCTCAGAAACTGGTGGTGCTCTC AGGG 1450 
GGACAAGACTTCCGTGGAGGAGTGGAGTCTTTGACCACCACGAGAGTCCC 
L F * 

CAAAATCATGTTCCCCACCCCCGGGGCAGAACCCCTCTTAGAAGCCTCTG 1500 
GTTTTAGTACAAGGGGTGGGGGCCCCGTCTTGGGGAGAATCTTCGGAGAC 

AGTCCCTCTGCAGAAGACCGGGCAGCAAAGCCTCCATCTGGAAGTCTGTC 1550 
TCAGGGAGACGTCTTCTGGCCCGTCGTTTCGGAGGTAGACCTTCAGACAG 

TGCCTTTGTTCCTTGA-^G AATGC AGC ATTGTCTTTGTCTGTCCCC ACCAC 1600 
ACGGAAACAAGGAACTTCTTACGTCGTAACAGAAACAGACAGGGGTGGTG 

ATGGAGGTGGGGGTGGGATC AATCTTAGGAAAAGCMMMGGGTCCCAG 1650 
TACCTCCACCCCCACCGTAGTTAGAATCCTTTTCGTTTTTTCCCAGGGTC 

ATCCCTTGGCCCTTTCCTCCGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 1700 . 

TAGGGAACCGGGAAAGGAGGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 

AAAAAAAAAAAAAAAAA 
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intlMMP MSPAPR PSRC LL LPLLTLGTAL ASLGSAQSSS 

mt2MMP MGSDPSAPGR PGWTGSLLGD REEAARPRLL PLLLVLLGCL GLGVAAEDAE 
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mc4MMP 
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MMP12 NNVL...FGE RYLEKFYGLE INKLPVTKMK YSGNU4KEKI QEMQHFLGLK 

MMP18 DYLSQYGYLQ KPLE GSNN FKPEDITEAL RAFQEASELP 

mtlMMP FS PE AWLQQYGYLP PGDLRTHTQR .SPQSLSAAI AAMQKFYGLQ 

mt2MMP VH .AE NWLRLYGYLP QPSRHMSTMR .SAQILASAL AEMQRFYGIP 

mt3MMP FN . , .VE VWLQKYGYLP PTSPRMSWR . SAETMQSAL AAMQQFYGIN 

mt4MMP ; MQQFGGLE 
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FIG. 5(h) 
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